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ABSTRACT 

Spitzer IRAC selection is a powerful tool for identifying luminous AGN. For deep IRAC data, however, the 
AGN selection wedges currently in use are heavily contaminated by star- forming galaxies, especially at high 
redshift. Using the large samples of luminous AGN and high-redshift star-forming galaxies in COSMOS, we 
redefine the AGN selection criteria for use in deep IRAC surveys. The new IRAC criteria are designed to 
be both highly complete and reliable, and incorporate the best aspects of the current AGN selection wedges 
and of infrared power-law selection while excluding high redshift star- forming galaxies selected via the BzK, 
DRG, LBG, and SMG criteria. At QSO-luminosities of log L2-iokev(ergs s"^)> 44, the new IRAC criteria 
recover 75% of the hard X-ray and IRAC-detected XMM-COSMOS sample, yet only 38% of the IRAC AGN 
candidates have X-ray counterparts, a fraction that rises to 52% in regions with Chandra exposures of 50- 
160 ks. X-ray stacking of the individually X-ray non-detected AGN candidates leads to a hard X-ray signal 
indicative of heavily obscured to mildly Compton-thick obscuration (log A^h (cm~^) = 23.5 ±0.4). While IRAC 
selection recovers a substantial fraction of luminous unobscured and obscured AGN, it is incomplete to low- 
luminosity and host-dominated AGN. 

Subject headings: galaxies: active — infrared: galaxies — X-rays: galaxies 



1. INTRODUCTION 

While supermassive black hole (SMBH) growth and galaxy 
formation were once assumed to proceed independently of 
one another, a new picture is emerging in which common 
triggering and/or feedback mechanisms drive the formation 
and evolution of both SMBHs and their hosts. To deter- 
mine which processes (e.g., secular evolution or major galaxy 
mergers) are primarily responsible for moving a galaxy onto 
the present-day M-a relation, however, we first require a com- 
plete census of luminous AGN activity. 

In X-rays, the typical AGN outshines even the most ac- 
tively star-forming galaxy, and as such, deep X-ray surveys 
provide the most reliable means of AGN selection. However, 
while X-rays penetrate low to moderate columns of obscuring 

^ Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, 
MD 21218; Giacconi Fellow; donley@stsci.edu 

^ Max Planck Institut fur extraterrestrische Physik, Giessenbachstrasse 
1, D-85748 Garching bei Munchen, Germany 

^ Spitzer Science Center, California Institute of Technology, MC 220-6, 
1200 East California Boulevard, Pasadena, CA 91 125, USA 

^ Department of Physics, Massachusetts Institute of Technology, 77 
Massachusetts Ave., Cambridge, MA 02139, USA 

^ Harvard Smithsonian Center for Astrophysics, 60 Garden St., Cam- 
bridge, MA 02138, USA 

^Laboratoire d'Astrophysique de Marseille (UMR 6110), CNRS- 
Universite de Provence, 38, rue Frederic Joliot-Curie, 13388 Marseille 
Cedex 13, France 

^ Steward Observatory, University of Arizona, 933 North Cherry Av- 
enue, Tucson, AZ 85721, USA 

^ National Optical Astronomy Observatory, 950 North Cherry Avenue, 
Tucson, AZ 85719, USA 

^ Instituto de Astronomia, Universidad Nacional Autonoma de Mexico, 
Ensenada, Baja California, Mexico (mailing address: PO Box 439027, San 
Diego, CA 92143-9027, USA) 

University of California, San Diego, Center for Astrophysics and 
Space Sciences, 9500 Oilman Drive, La Jolla, CA 92093-0424, USA 

Institute for Astronomy, University of Hawaii, 2680 Woodlawn 
Drive, Honolulu, HI 96822, USA 

12 UCO/Lick, University of CaHfornia, Santa Cruz, CA 95064, USA 

1^ INAF-Osservatorio Astronomico di Bologna, via Ranzani 1, 1-40127, 
Bologna, Italy 



dust and gas, 2-10 keV X-ray surveys miss a significant frac 
tion of moderately obscured AGN (~ 25% at Mi = 10 ^^ cm~^) 



and nearly all Compton-thick AGN (A^h > 10^"^ cm~^,lTreister 



et al.|2Q04t[Ballantyne eraL][2QQ6l [Tozzi et al.|p 
tits to the cosmic X-ray background, |Gilli et al. 



2QQ6|). F rom 
\\2mi) pre- 
dict that both moderately obscured and Compton-thick AGN 
are as numerous as unobscured AGN at high luminosity (log 
^o.5-2kev(ergs s~^)> 43.5), and are four times as numerous as 
unobscured AGN at low luminosity (log Lo.5-2kev(ergs s~^)< 
43.5). The obscured and Compton-thick AGN missed in deep 
X-ray surveys therefore serve not only as important probes of 
SMBH/galaxy co-evolution, but likely constitute a significant 
fraction of the total AGN population at all luminosities. 

To identify obscured AGN not recovered by X-ray surveys, 
studies have turned to the mid-infrared (MIR). Not only does 
the MIR emission from AGN-heated dust trace the repro- 
cessed radiation absorbed in other wavebands, but it is itself 
relatively insensitive to intervening obscuration. MIR selec- 
tion therefore identifies many heavily obscured AGN, nearly 



half of which are missed in deep X-ray surveys ( Donley et al. 
|2008j ). While many MIR-based selection criteria are there- 
fore designed to specifica lly target h eavily obscured AGN 
(e.g., Daddi et al. 2QQ7b| |Fiore et al. 2008, 2009), Spitzer 
IRAC selection is sensitive to the hot dust signature present 
in > 80-95% of luminous AGN regardless of obscuration 
( |Hao et al.|2"QTQl|201 1). As such, IRAC selection is a poten- 
tially powerful technique not only for identifying the heavily 
obscured AGN missed in the X-ray, but for selecting lumi- 
nous obscured and unobscured AGN when deep X-ray data 
are unavailable. 

The IRAC color-color cuts most co mmonly used for AGN 
selection we re defined by| Lacy et al.| (f2004, 2007)) and| Stern| 
et al.|(|2003]) using shallow IRAC data to which additionafBux 



cuts at 8 /im, 24 jim or R-ban d served to reject all but the 
brightest sources (^Sg um ^ 1 inJy ' ' ' ~ 

and ^s.o/xm ^ 76 /iJy, Stern et al 
[eTaTlpnOT] )). While these mitiat 



Lacy et al.| ( |2QQ4] ); R < 21.5 
2UU^p ;^24/im >^ m Jy, ' Lacy| 
1 color cuts therefore etiec- 
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lively select luminous AGN in samples containing onl y AGN 
and bright, low-redshift star-forming galaxies (see also |Sajina 
[Ft al.j|2005 ), they extend into regions of IRAC color space 
populated by moderate to high-redshift (z ^0.5) star-forming 
galaxies in the deep IRAC surveys now av ailable ac ross many 
cosmologica l fields (e.g., [Barmby et £0^ 2006^ ,Donley et al. 
2007 , 2008] ICardamone et al | |2008 ; Yun et aLpOOSH 
et al ||2009| " ~ ~ 

which ident 



1 Brusal 



Park et al.||2010| ). IRAC power-law selection, 
ties only the most robust of the IRAC color- 
selected AGN, has therefore been ad opted to minimize con 
tamination by normal ga laxies ( Alo nso-Herrero et al.||2006 



g by ^ 

[Donley et al.|2007[|2008| ). This technique, however, depends 
on both the IRAC photometry and the often-underestimated 
photometric errors, adding a degree of complexity not present 
in simple color-color cuts. 

Our understanding of the MIR source population has in- 
creased substantially in recent years, thanks in part to the 
Spitzer IRS spectra now available for large samples of both 
local and high redshift sources, and in part to the availabil- 
ity of deep IRAC data in multi- wavelength survey fields. It 
is therefore time to revisit the IRAC selection of AGN and 
redefine the selection criteria for use in deep survey fields. 
To do so, we focus on the 2 deg^ Cosmic Evolution Sur- 



vey (COSMOS, [Scoville et a l. 2007; Koekemoer et al.|2007| ). 
Unlike the deeper Great Observatories Origins Deep Survey 
(GOODS) fields, the combined size and depth of COSMOS 
provides large samples of both luminous AGN and normal 
IRAC-detected star- forming galaxies out to z ~ 3. Further- 
more, intensive spectroscopic follow-up campaigns and pho- 
tometric redshifts tuned to both AGN and normal galaxies 
give near complete redsh ift constraints on both the AGN and 
star-forming populations fLilly et al. ' 2007 [[Trump et al.|2009 



[Salvato et al. 2009; Ilbert et al. 2009 ). 

This paper is organized as follows. In §2, we provide an 
overview of IRAC AGN selection and discuss which AGN 
should reliably be selected by this method. We present the rel- 
evant COSMOS datasets in §3, and we discuss in §4 the IRAC 
power-law selected AGN in COSMOS. In §5, we present 
the properties of the COSMOS XMM sample, and in §6, we 
summarize the XMM sample's trends in IRAC color space. 
We then investigate the properties of the full IRAC sample 
in §7 and of high-redshift star- forming galaxies in §8. In 
§9, we present the revised selection criteria, and we sum- 
marize the results in §10. Throughout the paper, we as- 
sume the following cosmology: (r^ni,^A,^o)=(0-27, 0.73, 
70.5 km s~^ Mpc~^), and we quote all magnitudes in the AB 
system unless otherwise noted. 

2. IRAC AGN SELECTION: AN OVERVIEW 

The premise behind IRAC AGN selection is illustrated in 
Figure 1 , where we plot composite AGN+starbu rst SEDs con- 
structed using the QSOl and M82 templates of IPolletta et alj 
([2008 ). The SEDs of normal star- forming galaxies display a 
prominent dip between the 1.6 jim stellar bump and the long- 
wavelength emission from star-formation heated dust (Tdust ~ 
25-50 K). Dust near an AGN's central engine, however, can 
reach a sublimation temperature of r^ust ~ 1000-1500 K and 
thus radiate into the near- infrared (NIR). If the AGN is suffi- 
ciently luminous compared to its host galaxy, the superposi- 
tion of black body emission from the AGN-heated dust will 
fill in the dip in the galaxy's SED and produce a red, power- 
law like thermal continuum across the IRAC bands. As shown 
in Figure 1, while the UV-optical SEDs of obscured and unob- 
scured AGN bear little resemblance to one another, this char- 
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Fi g. 1. — Composite SEDs constructed using the QSOl and M82 templates 
of PoUetta et al. ( 2008 ), scaled to give 1-10 /im AGN contributions of 0% (red 
in the online version) to 95% (purple in the online version). The final SEDs 
have been normalized at 1.6 /im. In the lower panel, we apply an extinction 
of Av = 2 to the QSOl SED using the Draine ( 2003) extinction law. The four 
IRAC bands at z = are shaded. While luminous unobscured and obscured 
AGN have very different UV-optical SEDs, luminous AGN should display a 
red MIR power-law SED regardless of obscuration. 



acteristic MIR spectral shape should remain so long as the 
obscuring medium is optically thin at NIR-MIR wavelengths. 

The optical thickness of the obscuring medium at NIR-MIR 
wavelengths depends on the structure of the torus. A smooth 
and geometrically thick torus will obscure even the MIR emis- 
sion from Type 2 AGN, leading to a larg e offset in MIR lumi - 
nosity between Type 1 and Type 2 AGN ( [Pier & Krolik| 1993) . 
While no studies have observed the degree of MIR anisotropy 
predicted by smooth torus models. Type 2 AGN do appear to 
be ~ 3 - 5 fainter in the MIR than Type 1 AGN when the radio 
continuum lumi nosity is used to constrain the AGN's intrin- 
sic lu minosity (Heckman" 19951 [Buchanan et al.| 2006 [ |Haaj 
et al.|[2 008; Leipski et al. 20ld). However, no offset in MIE 
luminosity is observed when the AGN luminosity is instead 



normalized by the absorption-corrected X-ray luminosity and 
when the AGN's contribution to the MIR emission is isolated 
either spatially or spectrally ( |Lutz et aL]|2004t [Gandhi et al. 
2009| ). This suggests that the radio-based comparison may 
be biased, at least in part, by the contribution from beamed 
synchrotron emission in Type 1 AGN ( Cleary et al. 2007 ). 

While the observed degree of MIR anisotropy therefore 
remains an open question, a low optical depth is a natu- 
ral consequence of clumpy torus models, which predict only 
weak NIR-MIR anisotropics that essentially disappear by A ~ 
12/im (e.g. Nenkova et al. 2008a b ). Even at 1 /im, the clumpy 
torus model of Nenko va et al. | ( 2008b ) predicts an edge to pole 
flux ratio of < 5 for a cloud optical depth of ry = 60. For sam- 
ples of Type 1 and Type 2 AGN averaged across all viewing 
angles, we might therefore expect only small systematic off- 
sets in the observed MIR emission, and only at the lowest 
rest- frame wavelengths probed by the IRAC bands. We will 
investigate this issue in more detail in Sections 6 and 9. 
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Fig. 2. — Predicted z = 0-3 IRAC colors of AGN/galaxy composite SEDs in |Lacy et aLH2004||2007 ) color space, where the A GN fraction is defin ed between 
1 and 10 /im. The star-forming templates represent the ULIRG IRAS 22491 (square, [PoUetta et al. 2008), the starburst M82 (star, Polletta et al. 2008), a normal 
star-forming spiral galaxy (triangle. Dale & Helou 2002), and an elliptical galaxy (cir cle, Polletta et al. 2008), where large symbols mark each family of purely 
star-forming templates at z = 0. The AGN template is the QSOl template of Polletta et al. (2008 1. Additional extinctions of Ay = 0-2 and Ay = 0-20 are applied 
to the star-forming and AGN components, respectively. The wedge is the AGN selection region of Lacy et al. (2007), and the line represents the power-law 
locus from ot = -0.5 (lower left) to o; = -3.0 (upper right). While the templates of purely star-forming galaxies avoid the power-law locus, they enter the current 
AGN selection region at both low and high redshift. As the AGN's contribution to the MIR emission increases, the SEDs move inwards and redwards toward the 
power-law locus. 
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Fig. 3. — Same as Figure 2, but for | Stern et al.H2005} color space. The wedge is the AGN selection region of |Stern et"ar]j2005) . 
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2.1. IRAC colors of AGN/star-forming composites 

To illustrate the expected IRAC colors of galaxies with 
varying AGN contributions, we plot in Figures 2 and 3 the < 
z < 3 IRAC colors of composite SEDs constructed from the 
QSOl, elliptical (e ll2), starburst (M82) , and ULIRG (IRAS 
22491) templates ofPolletta et al." (20081 and the s tar-forming 
template of Dale & Helou ( 2002 ), plotted in both |Lacy et al.| 
02OO4) |2007| ) and Stern et al. (2005 ) color space. To allow 
for the effects of reddening, we apply additional extinctions 
of Av = - 2 and Ay = - 20 to the star-forming and AGN 
component s, respectively, using thep raine (2003 ) extinction 
curve (e.g., |Hickox et al.|2007[[Houet al. 2011 ). 

We also show in Figures 2 and 3 the AGN s election wedges 
of|Lacy et al.'(2007) and Stern et al.|(|20051) along with the 
IRAC power-law locus (the line on which a source with a 
perfect IRAC power-law SED would fall), where circles de- 
note power-law slopes of a = -0.5 to -3.0 (/^^ (xv^). While 
the power-law locus itself extends to bluer slopes, luminous 
AGN are expected to display red slope s of a < -0.5 ( Alonso- 
Herrer o et al. 2006 ; Donley et al. 2007|). 

In both Lacy et al. ( 2004, 2007) and [Stern et "aT] ( [2005] ) 
color-space, purely star-forming templates generally avoid the 
region surrounding the power-law locus, at least out to z ~ 3 
(the potential for contamination by higher redshift galaxies 
will be discussed in §8). As the AGN becomes increasingly 
dominant, the IRAC colors then shift inwards and redwards 
towards the power-law locus. Generally speaking, the more 
luminous the AGN, the redder the IRAC colors. A source's 
precise location in IRAC color space, however, will depend 
not only on the relative AGN/host contributions, but on its 
redshift, the reddening of its host and AGN components, and 
the host galaxy type. 

2.2. IRAC colors of purely star-forming galaxies 

While the purely star-forming templates shown in Figures 
2 and 3 lie outside of the powe r-law region in IRAC color 
space, they enter the [Lacy et'aLl ( 2004, 2007]) and lStern et al. 
( 2005j) AGN selection wedges at both l ow and high redshift 
(Barmby et al. |2006t [Donley et al.|2007|[2008 ). Thanks to re- 
cent Spitzer IRS spectroscopic campaigns in fields with deep 
IRAC data, however, we need no longer rely solely on tem- 
plates to constrain the IRAC colors of purely star-forming 
galaxies. 

The IRAC colors of Spitzer IRS sources classified as pure 
starbursts are plotted in Figure 4 (see Table 1 for details). 
While each IRS study uses slightly different criteria for iden- 
tifying purely star- forming galaxies, we standardize this def- 
inition by requiring that EW(6.2 /im)> 0.4 when t he 6.2 fim 
PAH feature lies within the observed bandpass ( [Weedman 
^Houck[|2009 ) and EW(11.3 /im)> 0.8 when it does not 
( [Dasyra et a l.|2009 ). We exclude any source flagged as having 
blended IRAC photometry, and plot for comparison the col- 
ors of sources that have not been observed with IRS, but that 



meet the 1. 5 < z < 3 star-forming IRAC criteria of Huang 
^taLl ( |2QQ9] ). 

The IRAC colors of the IRS star-forming galaxies are gen- 
erally consistent w ith the predictions from Figures 2 and 3. In 
Lacy e t aP (120041 12007 ) color space, low redshift galaxies lie 
predominantly above the AGN wedge, though some fall just 
within the AGN selection region. At 0.5 < z < 1.5, they fall 
both to the left of and within the AGN wedge, and at z > L5, 
the vast majority of star-forming galaxies lie within the AGN 
selection region. In [Stern et al. ([2005) color space, most low 



and moderate redshift star-forming galaxies fall below the se- 
lection region, and many of the high-redshift galaxies lie to 
the left of the AGN wed ge. However, a significant fraction of 
the [Huang et al. ( 2009) ) 1.5 < z < 3 star-forming candidates 
scatter into the AGN selection region, as do several moderate 
and high-redshift IRS -selected starbursts. 



With the exception of a z = 0.73 ULIRG from Dasyra et al. 
( [2009^ shown by a cross near the a ~ -1.0 power-law lo- 
cus), the star-forming galaxies lie well away from the power- 
law locus, especially at a < -1.0 and in Lacy et al. (2004) 
color space. The IRS spectrum of this source displa ys strong 
PAH emission with EW(11.3/im)= 1.43 ±0.07 (Dasyra et al. 



2009), and a low S/N optical spectrum shows strong [Oil], 
H/J, and [QUI] emission but no obvious high-ionization lines 



( [Papovich et al.|2006 ). At z = 0.73, however, neither the op- 
tical nor MIR spectral coverage extend into the 1-5 fim rest- 
frame regime sensitive to AGN heated dust. Given the avail- 
able data, it is therefore unclear whether this source's power- 
law like IRAC SED is due to a contribution from AGN-heated 
dust, or whether it is a true star-forming interloper. Never- 
theless, it is clear from Figures 2, 3, and 4 that while the 
current AGN selection regions inadequately separate AGN 
from moderate- to high-redshift star- forming galaxies, lumi- 
nous AGN should occupy a well-defined region of color space 
with minimal star-forming contamination. To test these pre- 
dictions and better constrain the region of IRAC color space 
that can reliably be used for AGN selection, we turn to COS- 
MOS. 

3. COSMOS DATA AND SAMPLE SELECTION 

The 2 deg^ COSMOS field is characterized by both deep 
and wide coverage across much of the electromagnetic spec- 
trum. We focus here on the XMM-Newton and Spit zer IRAC 
datasets that cover the full survey area ( Hasinge r et al.|2007 



Sanders et al.|2007|), as well as t he deeper Chandra data avail 



able over the central 0.9 deg^ ( [Elvis et al.[[2009| ). We also 
make use of the intermediate and broad-band optical-NIR data 
used to construct photometric redshift s for the X-ray-detected 
and nor mal galaxy populations (Capak et al."2 007] [Salvato 
et al |2009i Ilbert et al.,2009, ,Salvato et al.,2011| Capak et al. 
2011, in prepT 

3.1. IRAC 

The S-COSMOS IRAC data cover the full 2.3 deg^ COS- 
MOS field to 1200s depth, reaching 5a sensitivities of 0.9, 
1.7, 11.3, an d 14.6 //Jy in the 3.6, 4 .5, 5.8, and 8.0 /im bands, 
respectively ("Sanders et al.[|2007] ). The IRAC sample used 
here contains the 2625 1 non-flagged IRAC sources that lie in 
the 1 .73 deg^ region of C OSMOS with high-quality unmasked 
optical-NIR data ( filbert et al.[[2Q09| [2010^ , have an opti- 
cal/NIR co unterpart within a 2' search radius, 
( [2009] ) or 



as stars by Ilbert et al. 



, are not flagged 



McCracken et al.| (|2010|), 



have a photometric or spectroscopi c redshift, and meet the 5 a 
sensitivity limits of Sand ers et al.| ("2007 ) in each of the four 
IRAC bands. The aperture-corrected IRAC p hotometry was 
calculated in 1.9'^ apertures using SExtractor ( Sanders et al. 
[2007] ). 



3.2. XMM 



Our primary sample of X-ray select ed AGN is drawn from 
the ^ 40 ks XMM-COSMOS survey ([Hasinger et al.[[2007 
"Cappelluti et al.|[200 9"i Brusa et al.[j 2 
dra data of lElvis et al. ( 2009i) reach" 



2_0l0V While the C/2(3^ 
greater depths over the 
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TABLE 1 

Pure Starbursts with IRS Spectra 



Number 


Mean z 


Sample 


Fields 


Selection 


16 


0.06 ±0.04 


iHouck etal.li2007 


NDWFS 


/24 > lOmJy 


182 


0.13±0.10 


iSargsyan & WeedmanT2009k SWIRE, NDWFS 


z<0.5 starbursts 


22 


0.71 ±0.06 


tu et ai. (20i0i _ 


' COSMOS 


/24 > 0.7 mJy , z ~ 0.7 


24 


0.74 ±0.22 


'^Dasyra et al. ( ZU09 


SFLS 


/24 > 0.9 mJy 


4 


0.75 ±0.29 


iJrana et al. ( ZUU8^ 


NDWFS 


70 yum selected, optically-faint 


5 


0.86±0.14 


^arran et al. ( 2UU9 


SWIRE 


70 /im selected, optically-faint 


11 


1.63±0.12 


i^arran et ai. ( zuu8 


SWIRE 


IRAC bump selected 


6 


1.75 ±0.60 


Fope et al. ( 2UU8F 


GOODS-N 


SMGs 


4 


1.82±0.10 


[Weedman et al. ( 2Ul 


6l SWIRE 


IRAC bump selected 


6 


2.05 ±0.48 


nHuang et al. ( 2uu9 


r EGS 


IRAC bump selected 


2 


2.59 ±0.63 


|Menen"dez-Uelmestre efar.U2009| SWIRE 


SMGs 



O 
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Weedman et al. 2006 
Houck et al. 2007 
Brand et al. 2008 
Farrah et al. 2008 
Pope et al. 2008 
Dasyra et al. 2009 



Farroh et ol. 2009 z < 0-5 

Huang et al. 2009 0.5 < z < 1.0 

Menendez-Delmestre et al. 2009 1.0 < z < 1.5 

Sargsyan et al. 2009 1.5 < z < 2.0 
Fu et al. 2010 
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Fig. 4. — IRAC colors of pure starbursts, as determined via Spitzer IRS spectroscopy (large symbols). The small grey points represent sources that have not 
been observed with IRS, but that meet the 1.5 < z < 3 star-forming IRAC criteria of Huang et al. ( 2009 1. The wedges show the current AGN selection regions 
of (Lacy et al. 2007 left) and ( Stern et al.|2005] right), and the line shows the power-law locus from a = -0.5 to o; = -3.0. A significant fraction of purely 
star-forming galaxies lie within the current AGN selection regions, particularly at high redshift. 



central 0.9 deg^ and will be used later in the paper, the larger 
area of the XMM survey better samples the population of lu- 
minous yet low space density AGN, resulting in roughly equal 
numbers of Seyfert galaxies (log Lx(ergs s~^)< 44) and QSOs 

(log Lx(ergs s-^)> 44). 

The full XMM-COSMOS sample of |Brusa et al.| ( [2QT0 



contains 1797 sources, 93% of which have a spectroscopic 
or photometric redshift indicative of an extragalactic source. 
(Of the remaining sources, 5.5% are Galactic stars, and 1.5% 
lack a redshift estimate). We restrict our XMM sample to the 
sources that lie within the 1.73 deg^ region of COSMOS with 
deep, uniform, and unmasked optical-NIR data filbert et al. 
|2009 ), have a reliable optical/NIR counterpart inf Brus a et al. 
12010), are not fla gged as stars i n Brusa et al. (2010) 



|et al. (2009) , or McCracken et al., (2QlQ), have a photometric 
or spectroscopic redshift, and haveTiigh quality IRAC coun- 
terparts that meet or exceed the COSMOS 5a se nsitivity in 
each of the four IRAC bands ( [Sanders et al.|2007] ). Applying 
all but the IRAC detection criterion gives a sample of 1183 



sources, 12% of which are excluded in our final sample of 
1039 XMM sources with high significance IRAC counterparts. 

Of the XMM sour ces, 63 % have a spectroscopic redshift 
as compiled by Bru sa et al.| (2010 ) and the remain ing 37% 
have a n AGN- specific photometric redshift from , Sal vato et al. 
P009| ). To constrain the IRAC properties of the XMM sam- 
ple, we use the IRAC flux densities from I lbert et aL] ( |2QQ9 ) 
(as given by| Brusa et al. |2QlQ), which have been re-extracted 
at the positions of the XMM sources to minimize the effects 
of blending. To per mit comparis on with the 3^' optical-NIR 
aperture photometry, |Ilbert et al.| |2009 ) multiplied all of the 
IRAC flux densities by 0.75. Here, we undo this correction to 
recover the total IRAC flux densities. 

3.3. Chandra 

The COSMOS Chandra coverage reaches 160 ks depth 
over 0.5 deg^ and 80 ks depth over an additional 0.4 deg^ 



over aeg"- ana ks aeptn over an aaaitional U.4 aeg"- 
( Elvis et al.|2Q09| ). The full catalog of Chandra sources con- 
tains 1761 sources , 1010 of wh ich have optical/NIR and IRAC 
counterparts from |Elvis et al.| ( [2009) and Civano et al. (2011, 
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in prep.) that meet the criteria described above. Of these, 
56% are also XMM sources, and 61% have spectroscopic red- 
shifts. For the remaining Chandra sou rces, we adopt t he 



AGN-specific photometric redshifts of Salvato et al. (2011 



3.4. Optical-NIR Photometry and Redshifts 

Optical and NIR photometry a re taken from the in t erme 
diate and broad-band catalo gs oflCa pak et al.| (|2007| ), lllbert 



eraL] ( [20Q9l ), |McCracken et al. ( 2010 ) ana |Capak et al.| ( [2QTT|: 
We apply the band-specific systematic offsets given by 11- 



bert et al^ ( |2009 ), which correct for errors in the zero-points 
and mininuze the systematic scatter in the photometric red- 
shifts, and we aperture correct the photometry using the band- 
independent auto-offset aperture corrections. 

High confidence spectroscopic redshifts and spectral clas- 
sifications were compiled from the Sloan Digital Sky Sur- 
vey (SDSS, Kauffmann et al. 2003 ; Adelman-McCarthy et al. 
|2006j, the Mag ellan/IMACS and MMT observin g campaigns 



of|Prescott et a l. ( 2006 ), Trump et al. ( 2007|, and Trump et al. 
( |2009| ), zCOSTvlOS ( |Lilly et al.,2 007), and subsequent Keck 
runs. Photometric redsh ifts for non X-ray sources are taken 
from Ilbert et al. (2009 ), whereas AGN-specific photometric 
redshifts for thj XMM an d Chandra samples w ere drawn from 
Salvato etaL| ( |2009t and |Salvato et al.|p011| ), respectively. 



4. POWER-LAW SELECTION: A STARTING POINT 

IRAC power-law selection identifies sources that lie near 
the power-law locus in color space, and therefore selects only 
the most secure AGN-dominated candidates fr om the IRAC 
AGN wedges ( e.g., Alonso-Herrero et al. 2006] |Donley et al.| 
|2QQ7i 2008; P ark et al.||2010| ). For a source to be selected 
as a power-law AGN, a line of slope a < -0.5 must provide 
a good fit to its logarithmic IRAC photometry (/^^ ex z^", or 

= alogv + b). The goodness of fit is measured by the 
probability, P^, with a limit of P^>0.1 (Donley et al. 2007, 
|2008). ' ' ' 

IRAC power-law selection therefore depends not only on 
the IRAC flux densities used for color-color selection, but also 
on the IRAC errors typically derived using SExtractor. Unfor- 
tunately, SExtractor tends to underestimate photometric un- 
certainties, as it does not account for the correlated sky noise 
present in m osaiced data (e.g., Gawis er et al.||2006[ |Barmby| 
et al.|2008] ). Furthermore, the cataloged measurement errors 



do not account for the IRAC calibration uncertainty, which is 
at best 3% and is at worst 10% when the sub-pixel response 
and the array-location-dependent changes in pixel solid an- 



gle and spe ctral response are not taken into account (Reach 
|et al. 112005] ). The SExtractor errors can therefore be as low 
as Afjy/fjy = 0.1% for our IRAC sample, with the brightest 
IRAC sources exhibiting the most underestimated errors. 

Using such highly underestimated errors, it becomes nearly 
impossible to obtain a statistically acceptable fit to a given 
SED, even when the SED appears to closely match the pho- 
tometry. As an example, consider the 25 XMM sources that lie 
in the immediate vicinity (within 0.025 dex) of the a = -1.0 
power-law locus. Using the flux densities and errors from the 
official COSMOS IRAC catalog, the power-law criterion is 
met by only 30% of the fainter half of the sample and by none 
of the brighter half of the sample. 

This effect is further illustrated in Figure 5, where we plot 
the IRAC colors of the COSMOS XMM and IRAC samples 
and identify the sources selected as power-law galaxies using 
the cataloged errors and those selected as power-law galax- 
ies when additional uncertainties of 3%, 5%, 10%, and 15% 



are added to the cataloged errors. To quantify the dependency 
of power-law selection on the IRAC errors, we define a box 
around the power-law locus in |Lacy et al.|p004"l|200 7|) color 
space that encloses nearly all of the XMM sources with red 
(a < -0.5) IRAC colors (see Figure 5). If we were to se- 
lect IRAC power-law galaxies using the cataloged errors, we 
would recover only 5% of the XMM sources in the power-law 
box. Adding additional uncertainties of 3%, 5%, 10% and 
15% to the IRAC errors increases this fraction to 41%, 69%, 
96%, and 98%, respectively. 

Because the sources in the full IRAC sample tend to be 
fainter than those in the XMM sample, they display more scat- 
ter in IRAC color space. Nonetheless, we observe a similar 
trend in power-law selection: 12%, 40%, 62%, 91%, and 96% 
of the sources in the power-law box are selected as power-law 
galaxies using the cataloged errors and additional uncertain- 
ties of 3%, 5%, 10% and 15%, respectively. 

Adjustments to the SExtractor-derived IRAC errors are 
clearly required to identify complete samples of AGN that lie 
near the power-law locus in color space. In previous work, 
we have assumed an overall IRAC calibration uncertainty of 
10% to recover the majority of X-ray sources with red IRAC 
colors ( Alon so-Herrero et al.|2006l|Donley et al.|2007| |2008|). 
While this likely overestimates the true errors on the flux, this 
assumption facilitates the selection of sources with small de- 
viations from a perfect power-law SED. As shown in Figure 
5, however, these red, power-law dominated AGN occupy a 
weU-defined region of |Lacy et al. (2004, 2007 ) IRAC color 
space. Power-law galaxies could therefore be selected on the 
basis of their IRAC colors alone, thus eliminating the depen- 
dence of power-law selection on poorly constrained photo- 
metric errors and assumed uncertainties. 

In contrast, power-law gal axies canno t be cl eanly identi- 
fied using color-color cuts in [Stern et al.| ( |2005] ) color-space. 
This increased scatter arises from the use of adjacent bands 
(3.6/4.5 /im and 5.8/8.0 /im), which minimizes the wave- 
length baseline, increases the sensitivity to small variations in 
the SEDs, and groups together the two low-sensitivity IRAC 
channels (5.8 and 8.0 /im, see also [Donley et al.|2007||2008 



Richards et al. 2009 ). We therefore focus below on the [Lacy 



[et al. (2004, 2007 ) representation of IRAC color space, al 
tEbugh we will continue to show relevant plots in both repre- 
sentations to motivate the need for new selection criteria. 

We take as a starting point for these revised IRAC color- 
color cuts the a < -0.5 power-law box defined above. Not 
only does this region enclose nearly all XMM sources with red 
IRAC colors, but it also tightly encloses the AGN-dominated 
templates shown in Figure 2. While this power-law box 
should therefore be highly complete, at least to luminous 
AGN that dominate the light from their hosts, high-redshift 
star-forming galaxies may enter the bluest portions of the 
power-law box (see Figure 4). In the sections that follow, we 
therefore use the COSMOS XMM, Chandra, and IRAC sam- 
ples, as well as samples of high-redshift galaxies, to further 
refine this revised AGN selection region and to quantify its 
completeness and reliability. 

5. XMM SAMPLE: PROPERTIES 

To constrain the efficiency and completeness of IRAC se- 
lection and test for any trends in AGN properties across IRAC 
color space, we first determine the intrinsic obscuration and 
X-ray luminosity of the COSMOS XMM sample. 
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Fig. 5. — IRA C color-color diagrams for the XMM (left) and IRAC (right) samples in COSMOS, where the top and bottom plots show |Lacy et al.H2004||2007t 
and Stern et alj|2b05 1 color- space, respectively. Solid circles (red in the online version) indicate those sources that would be selected as a < -U.S power-law 
galaxies using the IRAC flux densities and errors from the official COSMOS catalog, and squares (gold in the onli ne version) indicate the additional power-law 
galaxies selected when the IRAC calibration uncertainty of 3% is added to the IRAC errors (Reach et al.|20051k. Likewise, (green) triangles, (blue) upside- 
down triangles, and (cyan) stars indicate the additional power-law galaxies selected when uncertainties of 5%, 10%, and 15% are added to the cataloged errors, 
respectively. To select complete samples of AGN that lie near the power-law locus in color space but that may exhibit small deviations from a perfect power-law 
SED, a 10% error on the flux must be assumed. 



5.1. AGN Type: Optical and X-ray Classification 

AGN can be divided into Type 1 (unobscured) or Type 2 
(obscured) classes using a variety of criteria, including opti- 
cal spectral classification, optical SED fitting, IR to optical 
flux ratios. X-ray spectral fitting, and X-ray hardness or flux 
ratios (for a disc ussion of the ag reement between classifica- 
tion schemes, see |Trouille et a l.|20Q9 ). Here, we focus on two 
methods. For the first, we adopt the approach of |Brusa et al. 



(2010 ) and use the optical spectra classificatio n when avail 
able and the best-fit broad-band SED type fromjSalvato et al. 



(2009) otherwise, and refer to the resulting classes as "opti- 
cal Type 1" and "optical Type 2". Of the XMM sources with 
Type 1 and Type 2 spectra, 85-90% are also best-fit by Type 1 
and Type 2 SEDs, respectively, indicating that these two vari- 
ations on optical classification are well-matched. Using this 
approach, we find an optical Type 2 fraction of 58% for the 
XMM sample. 

For the second approach, we turn to the X-ray and use each 
source's redshift, hard to soft X-ray flux ratio, and an assumed 
intrinsic X-ray photon index of F = 1.8 ( |Tozzi et al.||2006| ) 
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to estimate its intrinsic column density, A^h- We then group 
sources into "X-ray Type 1" and "X-ray Type 2" classes us- 
ing a cut of A^H = 10^^ cm~^. For the brightest sources in our 
sample, we compare our estimated column with that derived 
from X-ray spectral fitting (Mai nieri et al.|2007|| ). For the 24 
sources with measurable columns from both techniques, we 
find a typical absolute scatter of log A^h (cm~^) = 0.28 and 
a mean offset of Alog A^H(cm~^)= 0.02. The agreement be- 
tween our simple flux ratio method and the full spectral fitting 
is therefore generally good. 

Using the flux-ratio-derived column density, we measure X- 
ray types for ~ 3 /4 of our sample and find an X-ray Type 2 
fraction of 52%. Of the remaining sources, 90% are detected 
in only the soft band but have upper limits on A^h that are 
potentially consistent with the Type 2 class (due to the com- 
parably low sensitivity of the Chandra hard band), and 10% 
are detected in only the hard band but have lower limits on 
A^H that are potentially consistent with the Type 1 class. This 
leads to ambiguous classifications. 

Comparing the X-ray and optical types, we find that 65% 
of the optically-classified AGN have matching X-ray types 
when the latter is available. While this fraction is far from 
100%, 20-30% of AGN have mismatched optical and X- 
ray types even when classifications are made using deep X- 
ray data and optical spectra, an apparent discrepancy whose 
proposed explanations include spectral variability, host galaxy 
dilution, and disappearance of the br oad-line region in AGN 
accreting at L/ LEdd ^ Q -Ql (e.g., Tozzi et al.l2006UPage et al. 
2006 i Cacciani ga et al.|2 007 ; Trouille et al. 2009 ; Trump et af 
2009i|2011b|a| . To further complicate matters, observational 
scatter in the measured X-ray fluxes, when combined with the 
strict dividing line in column density between X-ray Type 1 
and Type 2 sources, leads to discrepant X-ray types for 15- 
30% of the sources in our sample detected by both Chandra 
and XMM. 

5.2. Intrinsic X-ray Luminosity 

For the 70% of XMM sources in our sample with a hard- 
band detection, we estimate the intrinsic (e.g. absorption- 
corrected) rest- frame 2-10 keV luminosities using the column 
densities estimated above and the observed 2-10 keV fluxes. 
While we can place only lower limits on the columns of the 
13% of sources that lack soft-band detections (with a median 
value of log A^H(cm~^)> 22.7), their estimated intrinsic lumi- 
nosities would increase by an average of only '^0.4 dex if 
all such sources were Compton-thick with log A^H(cm~^)= 24 
(and by only ~ 0.1 dex if log A^H(cm~^)= 23.5). In con- 
trast, because of the comparably low sensitivity of the hard 
X-ray band, the majority (77%) of sources detected only in 
the soft band have column density upper limits in excess of 
log A^H(cm~^)= 22, and 24% have upper limits greater than 
log A^H(cm~^)= 23 (assuming an intrinsic X-ray spectrum with 
F = 1.8). Such high columns can translate into order of mag- 
nitude corrections to the intrinsic luminosity estimated from 
the observed soft-band flux, so we do not attempt to estimate 
intrinsic luminosities for the 30% of XMM sources detected 
only in the soft band. 

All of the XMM sources with a hard band detection have 
both optical and X-ray classifications, with optical and X-ray 
Type 2 fractions of 57% and 58%, respectively, and 68% have 
spectroscopic redshifts. For sources with a measurable col- 
umn or those with only a hard-band detection, we estimate 
the absorption-corrected rest-frame luminosity by again as- 



suming an intrinsic F of 1.8. For sources with no measurable 
column (i.e., those that are softer than our assumed F of 1.8), 
we use the Galactic absorption-corrected hard to soft flux ra- 
tio to calculate the observed photon index, F, which we then 
use to determine the rest-frame luminosity. 

5.3. Broad-band SEDs 

Before discussing the IRAC colors of the XMM sample, we 
examine the median SEDs of sources binned by X-ray or op- 
tical type and X-ray luminosity. Doing so sheds light on the 
nature of AGN of different classes, and also illustrates the 
inherent differences and biases in the classification schemes 
which will affect our later interpretation. 

We plot in Figure 6 the best-fit templates to the median 
SEDs of the X-ray and optical Type 1 and Type 2 populations, 
as a function of intrinsic X-ray luminosity. To create these 
templates, we shift each individual SED into the rest- frame 
and normalize it to 1 /im (the approximate location of both 
the inflection point seen in luminous AGN and the midpoint 
of the stellar bump prominent in lower-luminosity AGN). We 
then calculate the median SED in bins of log AA(A) = 0.1 for 
all bins containing at least 10 points. This procedure gives a 
low-resolution median SED, albeit one with occasional gaps 
in wavelength coverage. To provide a clearer representation 
of the median SEDs, we then fit each median SED with all 
possible combinations of one of six AGN templates (Seyl.8, 
Sey2, QSOl, BQSOl, TQSOl, QS02) and one of 14 star- 
forming templates (comprising all elliptical and spiral tem- 
plates as well as the starburst/UL IRG templates of N6090, 
M82, Arp220, and 122491) from Polletta et al.|( [2Q08| , choos- 
ing as the final fit the template combination with the lowest 
reduced chi-squared. To account for obscuration, we allow 
for independent reddening of t he AGN and star- forming tem- 
plates using the praine ( 2003| ) extinction law. 

The SEDs shown m Figure 6 should be interpreted as rough 
guides to the overall behavior of the XMM sources. In general, 
they are far more highly sampled in the UV/optical than in 
the MIR, and they do not reflect the intrinsic variation seen in 
any given class of AGN (for more detai l ed SED analyses, se e 
(|20Q9| ), [Hi5"graLl ( |20TQ| ), |Lusso et al.| ( |2CTTq1 ), 
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Trump et al.| ( |2011al ), Lusso et al. (2011, in prep.), and Elvis 
et al. (2011, in prep.)). That said, they clearly illustrate the 
redder MIR colors of more luminous AGN (see also Figures 
1, 2, and 3) and confirm that this trend is relatively insensitive 
both to obscuration and to the classification scheme (X-ray vs. 
optical). 

5.3.1. Differences between X-ray and Optical Classification 
Schemes 

While the MIR SEDs plotted in Figure 6 are relatively in- 
sensitive to the obscuration scheme. X-ray and optical AGN 
classification returns samples with different UV-optical SEDs. 
This apparent discrepancy can be attributed in part to the 
physically distinct samples identified by the two criteria, and 
in part to selection biases. For a source to be classified as "op- 
tical Type 1", it must display either broad optical emission 
lines or have an SED best fit by an unobscured quasar tem- 
plate relatively flat in uFy. All remaining sources, including 
those with narrow optical emission lines, those with no emis- 
sion lines yet X-ray lumi nosities exceeding 2 x 10"^^ ergs s~^ 
(e.g., optically duU AGN, |Moran et al.|200"2l|Comastri et al. 
|2002j ), and those lacking optical spectra yet best-tit by a 
galaxy-dominated template are classified as "optical Type 2". 
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Fig. 6. — Template combinations (see §5.3) that best fit the median SEDs of the optical Type 1 and Type 2 AGN (left) and X-ray Type 1 and Type 2 AGN (right) 
in the XMM sample. From light to dark (or red to purple in the online version), the SEDs represent the following luminosity bins: log Lx < 43.0 (red), 43.0 < log 
Lx < 43.5 (yellow), 43.5 < log Lx < 44.0 (green), 44.0 < log Lx < 44.5 (blue), log Lx > 44.5 (purple). The grey dash-dotted SED is that of the starburst M82. 
MIR spectral shape is relatively independent of the X-ray or optical type, and is primarily a function of AGN luminosity, with more luminous AGN displaying 
redder MIR SEDs. The differences in the SEDs of X-ray and optically classified AGN can be explained by the selection effects and biases discussed in §5.3.1. 



In the absence of high-quality spectra, low-luminosity Type 1 
AGN (e.g., Seyfert I's) with host-dominated SEDs are there- 
fore likely to be misclassified as optical Type 2 AGN (see also 
discussion in |Brusa et al.|2Q1 0l. 

However, the vast majority (88%) of low luminosity 
(log L2-iokev(ergs s"^)< 43.5) X-ray Type 1 AGN in our XMM 
sample have robust optical spectra, yet 77% lack broad optical 
emission lines. While we cannot rule out the possibility that 
the host galaxies of these low-lumin osity AGN have diluted 
their broad emission li nes (e.g.. Page et al. 120061 ICaccianigal 
etal.|2007||Civano etal. 2007), Trump et al. (201 la) show that 
the X-ray unobscured narrow-line AGN in XMM-COSMOS 
have low accretion rates of L/L^dd < 10~^, and may therefore 
be fueled by radiatively inefficient accretion flows incapable 
of supporting a broad-line region. The X-ray Type 1, yet opti- 
cal Type 2, classification may therefore correctly represent the 
intrinsic nature of these low-luminosity AGN, whose seem- 
ingly discrepant properties give rise to many of the observed 
differences in Figure 6. 

X-ray classification is less sensitive to accretion rate and 
host galaxy properties, but is subject to a redshift-dependent 
bias. At low redshift, the soft X-ray band (0.5-2 keV) is sen- 
sitive to relatively low obscuring columns. At high redshift, 
however, low to moderate column densities become poorly 
constrained as the observed X-ray bands sample the higher 
energy rest- frame emission less subject to intrinsic obscura- 
tion. Small observational uncertainties in X-ray flux there- 
fore translate to large uncertainties in column density, and be- 
cause low columns are far more poorly constrained than high 
columns. Type 1 AGN are far more likely to be misclassified 
as Type 2 AGN than vice versa. We illustrate and quantify 
this well-known effect in more detail in the Appendix. This 
tendency to misclassify high-redshift (e.g., high-luminosity) 
X-ray Type 1 AGN as X-ray Type 2 AGN likely accounts for 
the blue UV-optical continuum of the most luminous X-ray 
Type 2 SED in Figure 6. 



6. XMM SAMPLE: TRENDS IN IRAC COLOR SR\CE 

To test for trends in IRAC color space as a function of lu- 
minosity, redshift, and AGN type, we plot in Figure 7a the 
IRAC colors of all XMM sources detected in hard X-rays. We 
then extend the power-law box defined in §4 from a = -3.0 to 
a= 1.0, enclosing 92% of the XMM sources, and plot in Fig- 
ure 7b the median luminosity, redshift, and X-ray and optical 
types of the XMM sources in bins of power-law slope. 

6.1. Trends in X-ray Luminosity and Redshift 

A clear trend is seen in X-ray luminosity, with the more lu- 
minous X-ray sources displaying redder IRAC colors, as ex- 



pected (see also 


Cardamone et al.||2008l |Brusa et al.||2009 


Eckart et al. 201 


[0\ Park et al. |2010[ Trump et al. |2011a|). 



While there is no cut in power-law slope that cleanly distin- 
guishes QSOs (log L2-iokev(ergs s~^)> 44) from less lumi- 
nous Seyfert galaxies (log L2-iokev(ergs s~^) < 44), a cut of 
a = -0.78 best separates the two populations, with equal frac- 
tions (72%) of high-luminosity AGN falling redward of the 
cut and low-luminosity AGN falling blueward of the cut. 

The median redshift also increases for sources with redder 
IRAC colors, from z~l ata=1.0toz~2ata = -2.25. 
This correlation is driven primarily by the observed trend in 
luminosity, as only the most luminous sources (which tend to 
have redder IRAC colors) can be detected out to high redshift. 
Between z=l and z = 2, the SED templates shown in Figure 
2 move roughly perpendicularly to the power-law locus, at 
least for AGN fractions < 80%. Only for templates with AGN 
fractions of 95% and moderate Av,agn ~ 2-8 do we see a 
trend towards redder IRAC colors between z = I and z = 2, 
although this shift of Aa ~ 0.25-0.4 is too weak to account 
for the observed correlation between redshift and IRAC color. 

6.2. Trends in AGN Type 

While the X-ray Type 2 fraction is essentially independent 
of a, the optical Type 2 fraction drops smoothly from a = 1 
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Fig. 7. — left panel: IRAC color-color diagram for hard-band-detected XMM sources, as a function of X-ray luminosity and optical type. The power-law box 
has been extended from a = 1.0 to -3.0. right panel: Properties of the XMM sources in each of the bins in a containing at least 10 sources (a > -2.25). The 
trends for sources with spectroscopic redshifts are shown by dash-dotted histograms (red in the online version). The filled circles and triangles give the expected 
Type 2 AGN fractions from Gilli et al., ( ^2007j and Hasinger ( 2008 >, respectively, based on the median X-ray luminosity in each bin. AGN with redder IRAC 
colors (more negative power-law indices) tend to have higher ^-ray luminosities, lie at higher redshift, and be somewhat less obscured. 



to a ~ -1.6 before rising again at the reddest IRAC colors. 
These seemingly inconsistent trends remain, and are even en- 
hanced, if we consider only those sources with spectroscopic 
redshifts (shown in red). 

Because high luminosity AGN tend to be less obscured than 
their low luminosity counterparts, at least in X-ray selected 
samples, we might expect sources with redder IRAC colors 
to have a lower Type 2 fra ction (e.g. Ueda et al. 2003 ; Gilli 
|et al.|2007[rHasinger|200"8l ). We plot in the right panel o f Fig- 
71 



ure 7 the expect ed Type 2 fractions of [Gilli et~ST 



Hasinger] ((2008) at the median X-ray luminosity of 
While a slight decrease in the Type 2 fraction would be ex- 
pected over the observed range in luminosity, the trend we ob- 
serve for the optically-classified AGN is much stronger than 
predicted. 

These observed trends, however, reflect the biases and se- 
lection effects discussed in §5.3. At blue IRAC colors, the ex- 
cess of low-luminosity, optical Type 2 AGN can be attributed 
to the population of unobscured narrow-line AGN in XMM- 
COSMOS ( |Trump et a l. 2011b). Not only are these weak 
accretors likely to lack a broad-line region, but their anoma- 
lously blue IRAC colors also suggest that they, like many low- 
luminosity, lowaccretion rate radio galaxies, may lack a hot 
dust torus (jTrump et al. 201 lb; Ogle et al. 2006). Likewise, 
the excess of X-ray Type 2 AGN at red IRAC colors can be 
attributed to the overestimation of X-ray column densities at 
high redshift (see §5.3 and the Appendix). 

Given the nature of these biases, an AGN's intrinsic obscu- 
ration is best represented by the X-ray classification at low 
luminosity/redshift, and by the optical classification at high 
luminosity /redshift. If we therefore assume the X-ray derived 
Type 2 fraction at blue slopes of a > -0.5, and the optically- 
derived Type 2 fraction at red slopes of a < -0.5, we find a 
distribution that favors unobscured AGN at red IRAC colors 



but whose evolution is not nearly as pronounced as the optical 
classification alone would suggest. This trend is likely driven 
primarily by the anti-correlation between luminosity and ob- 
scuration, and while this new hybrid estimate of the obscured 
AGN fr action may display a somewhat steeper evolution than 
those of Gilli et al.] ( |2007 l and'Hasinger| ( (2008| ), a direct com- 
parison IS complicated by the scatter in X-ray luminosity in a 
given power-law bin, our necessary exclusion of sources with 
ambiguous X-ray types, as well as the expected incomplete- 
ness of the XMM sample to the obscured AGN that are the 
target of MIR selection. 

An optically thick torus could also contribute to the ob- 
served trend by masking the hot dust signature from ob- 
scured AGN. As discussed in §2, however, the large NIR-MIR 
anisotropics predicted by models of smooth, geometrically 
and optically thick tori are not supported by recent observa- 
tions, particularly when the intrinsi c X-ray luminosity is used 
to norm alize the MIR luminosity ( |Lutz et J^|2004[ Gandhi 



let al.J2009). However, studies that normalize the AGN s lumi 
nosity by the radio continuum find a factor of a few difference 
in MIR luminosity between unobscured and obscured AGN 
( |Heckman|[T995l |Haas et al.||2QQ8| [Leipski et al.][2QTQ 



return to this issue m §9. 



We 



7. IRAC SAMPLE 

While the XMM sample sheds light on the potential com- 
pleteness of IRAC color selection to AGN of different lu- 
minosities and/or type, XMM sources comprise only 4% of 
the full sample of COSMOS IRAC sources. In the sections 
that follow, we test our ability to cleanly separate the AGN- 
dominated IRAC sources from the remainder of the primarily 
star-formation-dominated IRAC population. 
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Fig. 8. — X-ray detection fra ction for s ources with at least 50 ks of Chandra coverage, for bins containing at least 10 sources. The top and bottom panels show 
[Lacy et al. ( 2004 2007) and Stern et al.| (2005 ) color space, respectively. From left to right, the three rows show the X-ray detection fraction for all sources in 
our IKAC sample (^Ss.g/im > 11. 3 //m), and for sources with ^Ss g/im > 25/im and Ss.^^m > 50/im. The dashed line in the top panels shows the -3 < a < -0.5 
power-law box defined in §4. At the full depth of COSMOS, the X-ray detection fraction is relatively constant at ~ 50% within the power-law box, but it drops 
to < 10% across much of the remainder of IRAC color- space, including regions enclosed by the current AGN selection wedges. 



7.1. X-ray Detection Fraction 

We plot in Figure 8 the Chandra X-ray detect ion fraction 
(down to a limiting flux of 5.7 x 10^^ ergs s~\ |Elvis et aL 
||009 ) for the 1 1324 IRAC sources with at least 50 ks ofChan- 
dra coverage, for which the median (mean) X-ray exposure is 
148 ks (128 ks). To maximize the resolution in IRAC color 
space, we divide the IRAC sample into grids containing at 
least 10 sources. The typical grid size is therefore inversely 
proportional to the density of IRAC sources in a given region 
of color space. While the Chandra dataset only covers the 
central 0.9 deg^ of the COSMOS field, its increased depth and 
higher resolution than the larger-area XMM data allow us to 
probe more deeply the X-ray properties of the IRAC sample. 

At the full depth of the COSMOS IRAC survey, 42% of the 
sources in the a < -0.5 power-law box are detected by Chan- 
dra, compared to only 5% of the sources outside the power- 
law box (see Figure 8). Within the power-law box, the X- 
ray detection fraction remains relatively constant down to a 
slope of a ~ -0.75. It drops somewhat, however, at the bluest 
extremes of the power-law box where we might expect con- 
tamination from high-redshift galaxies (see Figures 2, 3, and 
4). An additional cut in 8.0 to 4.5 jim color may therefore be 
warranted, and will be explored in more detail in §8. 

It is also clear from Figure 8 that at the moderate depth 
of the COSMOS IRAC data, s ignificant fractions of the cur- 
rent AGN selectio n regions of |Lacy et aT| p004| [2007 ) and 
Stern et aL] ( |2005| ) are characterized by low X-ray detection 



fractions indistinguishable from the typical values outside of 
the selection wedges. These regions of color- space coincide 
with the regions where we expect contamination from low and 
high-redshift star- forming galaxies, and further motivate the 
need for new AGN selection criteria. 

To illustrate the effects of IRAC depth, we also plot in Fig- 
ure 8 the X-ray detection fractions for limiting flux densities 
of 25 and 50 /iJy in the 5.8 jim band (which effectively limits 
the detection of red, powe r-law like sources) . F or compari- 
son, th e IRAC data used by |Lacy et"ar] ( |2004| ) and |Stern et al. 
( |2005| ) to define the current AGN selection regions have lim- 
iting 5 .8 jim depths of 60 /iJy and 76 /iJy, respectively. As the 
IRAC data become progressively shallower, the X-ray detec- 
tion fractions rise (to 64% and 77% in the power-law box at 
limiting flux densities of 25 and 5 /iJy, respectiv ely) and the 
number of grids in the |Lacy et al.| ([2Q04 , 2007]) and |Stern et al. 
p005| ) AGN selection regions with low, star-formation-like 
X-ray detection fractions falls. This trend reflects the increase 
in the AGN fraction with increasing MIR flux density (e.g., 



Brand et al. 2006 Treister et 



aL] [2006l [Donley et al.][2QQ8] ). 
iination by star-forming salax- 



While the likelihood of contamination by star-torming ga 
ies therefore decreases at higher IRAC flux densities, so too 
does the number of X-ray non-detected AGN candidates in 
the power-law box, from 570 at the full IRAC depth to 182 
and 67 at limiting 5.8/im flux densities of 25 and 50 /iJy, re- 
spectively. 
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Fig. 9. — Hardness ratios (HR = (H-S)/(H + S)) of the stacked emission from X-ray non-detected (left) and X-ray detected (right) IRAC sources wit h at least 
50 ks of Chandra coverage, where H and S represent the 2-8 keV and 0.5-2 keV ACIS count rates, respectively. The top and bottom panels show Lacy et alT] 
f2004 2007 1 and Stern et al. ( 2005 1 color space, respectively. The minimum number of sources per grid is 200 for the X-ray non-detected sources, and 25 tor 
the X-ray detected sources. White (red in the online version) and black boxes indicate those grids with a detection in only the soft or hard bands, respectively. 
Grids detected in neither band are shaded with hashed lines. The dashed line in the top panels shows the power-law box defined in §4. The hardest X-ray signal, 
consistent with emission from obscured to mildly Compton-thick AGN, comes from sources in the power-law box. 



7.2. X-ray Stacking 

While the vast majority of IRAC sources lack X-ray coun- 
terparts, X-ray stacking can shed light on the nature of the X- 
ray non-detected population. Using vS.Obeta of the program 
CSTACK0 

we stack the X-ray emission for grids in IRAC 
color space containing at least 200 X-ray undetected sources. 
We exclude all sources that lie more than 8' from the Chan- 
dra aim-point, and extract source counts within the smaller of 
either the position-dependent 90% encircled-energy radius or 
the 5'^ inner background subtraction radius. We then measure 
the background across the remaining 15''xl5'' image, after 
masking all known X-ray sources. 
We plot in Figure 9 the X-ray hardness ratios, HR, for grids 

http://cstack.ucsd.edu/cstack/ 



detected to > 3 a in the soft and/or hard X-ray bands, and 
plot for comparison the stacked hardness ratios of the X-ray 
detected sources (HR = (H -S)/(H -\-S), where H and S rep- 
resent the 2-8 keV and 0.5-2 keV ACIS count rates, respec- 
tively). While low source densities limit the resolution along 
the power-law locus, we detect hard X-ray emission from all 
red (a < 0) power-law grids in |Lacy et al.| ([2004, 200^ color 
space. In contrast, the majority of grids outside of the power- 
law box are detected only in soft X-rays, and those with mea- 
surable hard X-ray emission lie in regions where we expect to 
find lower-luminosity AGN whose hosts dominate their MIR 
SEDs. 

The hardest X-ray signal is observed for sources in the 
a < -1.0 power-law box: HR= 0.31 ±0.13, where the er- 
rors have been derived from the CSTACK bootstrap analy- 
sis. For comparison, the stacked emission from X-ray de- 
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tected sources in this region of color space is significantly 
softer: HR= -0.31 ±0.13. At the typical photometric red- 
shifts of the X-ray and non-X-ray sources, z= 1.8 ±0.7 
and z = 2.1 ± 1.0, respectively, these hardness ratios corre- 
spond to column densities of log A^H(cm~^)= 22.4 ± 0.4 and 
log A^H(cm-2)= 23.5 ±0.4. 

While 61% of the X-ray sources have spectroscopic red- 
shifts and the remainder have AGN- specific photometric red- 
shifts, only 5% of the X-ray non-detected sources in this re- 
gion of color space have spectroscopic redshifts and their 
photometric reds hifts have been cal culated using only normal 
galaxy templates ( [Ilbert et al.|2009| ). To place an independent 
constraint on the typical redshift of the X-ray non-detected 
sources in the a < -1 .0 power-law box, we turn to their IRAC 
fluxes, which are on average 3. 1 ± 0.3 times fainter than those 
of the X-ray sample. If we assume that this fairly uniform 
dimming across the four IRAC bands can be attributed solely 
to redshift effects (i.e., that the X-ray and non-X-ray power- 
law samples have similar intrinsic MIR luminosities), we esti- 
mate a typical redshift of z = 2.7 and a typical column density 
of log A^H(cm~^)= 23.7 for the X-ray non-detected sources, 
slightly higher than the values quoted above. 

7.3. X-ray/MIR Relation 

The observed X-ray to MIR luminosity ratio, a proxy for 
AGN obscuration, allows us to place an independent con- 
straint o n the column densities of the a < -1.0 power-law 
sources (|Lutz et al.|2004||Maiolino et al.|2007t [Gandhi et al.| 



2009} [Park et al.||2010| ). For the X-ray detected and non- 
detected AGN, we approximate the median 6.7 jim rest- frame 
luminosity from the median power-law slope and observed 
8.0 fim IRAC flux density, and we calculate the rest-frame 2- 
10 keV luminosity from the stacked 0.5-2 keV (1.5-6 keV at 
2) X-ray flux, assuming F = 1.4. For the subsample of 
sources with MIPS 24 jim (rest- frame 6.5 jim) counterparts, 
this extrapolated 6.7 jim rest- frame flux density is in excellent 

agreement with the median observed 24 jim flux density^ 

Using the X-ray/MIR relation of IMaiolino et aL] ( |2007| ), 
we measure a MIR-derived intrinsic X-ray luminosity of 
log L2-iokeV (ergs s~^)~ 43.8 for the individually X-ray non- 
detected a < -1 .0 power-law galaxies, ~ 20 times higher than 
observed in the stacked signal. To achieve this degree of hard 
X-ray suppression, a marginally Compton-thick column den- 
sity of log A^H(cm~^)~ 24 is required, generally consistent 
with the HR-derived values above. In comparison, we see no 
significant offset between the observed and MIR-derived hard 
X-ray luminosities of the X-ray detected sources in this region 
of color space (log L2-iokev(ergs s~^)~ 44.1), as expected for 
sources with low to moderate (log A^H(cm~^)< 23) obscuring 
columns. 

8. HIGH REDSHIFT GALAXIES 

The IRAC colors of purely star-forming galaxies generally 
avoid the power-law region of color space, at least out to z < 3 
(see Figure 2). At z > 3, however, only one IRAC channel 
samples the red side of the 1.6 jim stellar bump, and by z 5, 
the IRAC bands fall entirely on the blue side of this feature. 
In high-redshift galaxies with older stellar populations and/or 
measurable dust extinction, this stellar bump has the potential 
to mimic the power-law emission from AGN. To determine 
whether high-redshift star-forming galaxies are an important 
source of contamination, we investigate below the IRAC col- 
ors of high-z sources in COSMOS and in the deeper GOODS 



fields. 

While we could attempt to identify high-redshift candidates 
purely on the basis of their photometric redshifts, we are pri- 
marily interested in sources whose stellar bumps fall redward 
of the IRAC bands and whose fluxes are faint, precisely those 
sources for which photometric redshift fitting is most diffi- 
cult. To constrain the likelihood of contamination by high-z 
star-forming galaxies, we therefore turn to spectroscopically- 
confirmed and color-selected high-z samples: BzK-selected 
galaxies (jPaddi et al.||2 004), distant red galaxies (DRGs, 
Franx et al.| |2003Hvan Dokkum et al. 2003), Lyman-break 
galaxies (LBGs, e.g., Steidel et al. 2003), z > 3 evolved and/or 
reddened galaxy candidates, a nd sub-mm galaxies (SMGs) . 
In the IRAC-selected sample of |Perez-Gonzalez et al.| ( |2008 ) 
drawn from the HDF-N, CDF-S, and Lockman Hole fields, 
90% of the IRAC sources at z = 3 - 3.5 meet at least one of 
the BzK, DRG, or LEG criteria, as do ~ 80% of the sources 
at z = 3.5 -4. 

8.1. BzK Galaxies 

The BzK selection technique of |Daddi et al.| (|2004|) was 
designed to identify star-forming (sBzK) and passive (pBzK) 
galaxies at 1.4 < z < 2.5, although it also identifies luminous 
AGN generally removed from BzK-selected samples on the 
basi s of the ir hard X-r ay or MIR power-l aw emission (see. 



e.g., [Sharp et al. 2002} 



Daddi 



VLIK pc 

et al |2 



2007a). To constrain the 



IRAC colors of BzK-selected galaxies in COSMOS, we adopt 
the COSMOS BzK catalog of [McCracken et al.| ( |20T0l ). 

We plot in Figures 10a and 11a the IRAC colors and redshift 
distributions of the 14% of sBzK galaxies that meet our IRAC 
detection criteria. Based on the IRAC color distribution and 
the relative density of X-ray sources, we divide the IRAC- 
detected sBzK galaxies into two populations. The dominant 
population (7 9%) occupies the s ame region of IRAC color 
space used by 'Huan g et al.| ( [2009] ) to identify 1.5 < z < 3 star- 
forming galaxies: log(8 . 0/im/4 . 5 /im) < 0.15 (see Figure 4). 
Only 11% of these relatively blue > 50 ks sources have a 
Chandra counterpart. 

The population of sBzK sources with red IRAC col- 
ors (log(8.0/im/4.5/im) > 0.15) extends upwards along the 
power-law locus and has an X-ray detection fraction of 49%, 
higher even than the median value in the power-law box (see 
§7.1). While these red sources are therefore strong AGN can- 
didates. Figures 4 and 10 indicate that an additional color cut 
of log(8.0/im/4.5/im) > 0.15 should be applied to the AGN 
selection criteria to prevent contamination from high-redshift 
star-forming galaxies. This cut, shown in Figure 10, rejects 
sources in the lower left comer of the p ower-law box, and is 
slightly bluer than the cut proposed by Coppin et al. (2010) 
to distinguish AGN and star-form ation-d ominated SMGs, 
log(8.0/im/4.5/im) > 0.22 (see also |Yunet al. 2008 ). 

We plot in Figures 10b and 1 lb the IRAC colors of the 31% 
of pBzK galaxies detected to > 5cr in the four IRAC bands. 
Nearly all (97%) of the pBzK galaxies have IRAC colors blue- 
ward of the cut defined above, and only 4% of these are X-ray 
sources. Of the few sources with red IRAC colors, 50% are 
again X-ray detected. 

8.2. DRGs 

The distant red galaxy (DRG) criterion, (/-^)ab > 1.37 or 
(/-^)vega > 2.3, was designed to select z = 2-3.5 galaxies 
with a strong B aimer or 4000A break and identifies 77% of 
massive galaxies at z = 2-3 (van Dokkum et al.,2006j . Using 
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Fig. 10. — IRAC colors of high-z galaxy populations with at least 50 ks of Chandra coverage, in|Lacy et al.|( |2004 ] |2007 ) color space. Black circles (red circles 
in the online version) indicate sources with Chandra counterparts, and the inset histograms give the redshift distributions of the high-z populations. The dotted 
line shows the log(8.0/im/4.5/im) < 0.15 cut defined in §8 to exclude high-redshift star-forming galaxies from the power-law box (dashed line). In panel (d), 
only the three sources indicated by large circles are bright enough to meet the COSMOS 5cr limits in all four IRAC bands. After applying the 8.0 /im/4.5 /im cut 
indicated by the dotted line, we therefore expect minimal contamination from high-redshift star-forming galaxies. 



the UKIRT / and CFHT K, data from |Capak et al.| ( [2QTT 
andl McCracken et al.| ( [2QT0 |, we identify - 5000 DRGs m 
COSMOS, 25% of which lie above the > 5a sensitivity hmits 
in each of the IRAC bands. Their IRAC colors and redshift 
distribution are shown in Figures 10c and 11c. 

While the DRG color cut was originally intended to select 
sources with a strong B aimer break at z 2-3.5, it al so re- 
c overs dusty star-forming gal axies and AGN at z < 1.4 ( [Con- 
lice et al.|2007t|Lane et al.j2007) ). For clarity, we therefore 
divide the DRG sample in Figures 10c and 11c into low and 
high redshift components, adopting a cut near the minimum 



in our redshift distribution, z = 1.6. 

The DRGs, like the BzK- selected galaxies, can be divided 
into two main populations using a cut of log(8.0/im/4.5/im) = 
0.15. The bluer sources, which comprise 91% of the popu- 
lation, have an X-ray detection fraction of only 6% and can 
again be excluded from the AGN selection region using the 
cut defined above. The red sources extend along the power- 
law locus and have a far higher X-ray detection fraction of 
53% (or 56% if we consider only the red sources that lie in the 
power-law box), indicative of AGN e mission. In the deeper 
GOODS-S field, jPapovich et al.| ( [2Q06| find X-ray or MIR ev- 
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Fig. 1 1 . — Same as Figure 10, but for |Stern et al.H2005) color space. 

idence of AGN emission in 25% of DRGs. In COSMOS, if 
we take as AGN all X-ray detected DRGs as well as all red 
(log(8.0/im/4.5/im) > 0.15) X-ray non-detected DRGs in the 
power-law box, we conclude that at least 14% of the IRAC- 
detected DRGs are AGN. 

8.3. LBGs 

The Lyman-break dropout technique remains the most suc- 
cessful method of identifying galaxies at z ^ 3-1. Of the 
photometrically- selected U, B, g, V, R, and i dropout can- 
didates in COSMOS, however, fewer than 2% are detected 
at or above the 5a sensitivity limits in all four IRAC bands. 
As these bright sources are also those most lik ely to be low- 
redshift interlopers (see, e.g., |Reddy et al.| 2()08 ), we limit our 
analysis of LBGs to sources that have been spectroscopically- 
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confirmed. 

Only three spectroscopically-confirmed star-forming z > 3 
LBGs in COSMOS are detected to > 5cr in all four IRAC 
bands, and two of these are in a close pair with blended IRAC 
photometry. We therefore turn to the ultra-deep GOODS 
fields and adopt the sam ple of 74 spectrosco pically-confirmed 
z = 3.8-5.0 LBGs from Shi m eFaL] ( [20TT] ), selected to have 
> 5(7 detections in the 3.6 and 4.5 jim IRAC bands. We plot 
in Figures lOd and lid the IRAC colors of the 27 LB Gs de- 
tected to > 3(7 in all four IRAC bands. As none of thelShimJ 
[erall ( [2QTT| GOODS sources meet the COSMOS 5a sensitiv- 
ity cuts in all in four IRAC bands, we do not expect LBGs to 
be a significant source of contamination at the depth of COS- 
MOS. Nonetheless, their IRAC properties shed light on the 
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potential contamination by high-redshift star-forming galax- 
ies in surveys with deeper IRAC data. 

Despite their median redshift of z = 4.2, the LBGs have 
relatively blue IRAC colors. Of the 27 LBGs, only six 
fall within the power-law box, and half of these are X-ray- 
detected AGN. This behavior can be attributed to the require- 
ment that LBGs have a bright UV continuum (which in turn 
biases the selection towards galaxies with young stellar pop- 
ulations, low extinctions, and thus blue UV-optical rest- frame 
colors) and to the bright Ha emission observed in 70% of the 
Shim et aL| ( |2Qll| ) sources, which leads to an excess of 3.6 /im 
emission relative to the best-fit stellar continu um. If no Ha 
were present, however, t he, Shim et al .' ('2011') sources would 
simply shift to the right in |Lacy et al.| (2004 , 2007 ) color space 
and continue to be excluded by the 8.0/im/4.5/im cut defined 
above. For a high-redshift galaxy to display red colors in the 
observed IRAC bands, it must be more heavily reddened or 
have an older stellar population than the typical LBG. 

8.4. Evolved and Reddened z > 3 Candidates 
Several attempts have been made to select reddened and/or 



evolved galaxies at z > 3 ( 


Yan et al.||2006lJRodighiero et al. 


20071 IWiklind et al.||2008 


IMancini et al. 120091 IMarchesini 


et al.||2010|). The high 24 /xm detection fraction for these 



forming galaxies at z = 2 - 3 or to a dominant obscured AGN 



component at high red shift (see, e.g., [Dunlop et al.| 2007 [ 
Marchesini et al.|2()T0 ). We therefore plot m Figures lOd and 
lid only the "no-MlPS" z > 3.5 IRAC-selected and z > 5 
B aimer-bre ak-selec ted high-z candidates fro m the G OODS 
samples of [Mancini et al. (2009) ^d Wiklind et al.|([2008), 
and further restrict the [Mancini et al. ( 2009 ) sample to those 
sources lacking a z < 3 redshift solution within the 90% con- 
fidence interval. 

None of the high-z evolved and/or reddened galaxy can- 
didates plotted in Figures lOd and lid meet the COSMOS 
5(7 sensitivity limits in all four IRAC bands, so we again ex- 
pect little to no contamination from such galaxies in COS- 
MOS. Furtherm ore, only 5 of the 13 z > 3.5 candidates from 
Mancini et al.| (j2009^ lie within the power-law box , primar- 
ily at colors blueward of a ^ -1.0. (The reddest [Mancini 
et al. ( 2009) source in the power-law box is the Zphot = 3.93 
SMG Gm200.5/AzGN06.) Likewise, two of the three z > 5 
B aimer-break candidates of Wi klind et al.| ( |2008] ) fall in the 
power-law box, though again at a > -1.0. The relatively blue 
continua of these galaxies, compared to the redshifted tracks 
of local templates, can likely be attributed to the bluer UV 
continua and lower dus t extinctions of high redshift galax- 
ies (Bou wens eraD '20Q9). Even in the deep GOODS field, 
we therefore expect only minimal contamination from typical 
high-redshift galaxies, and only in the bluest regions of the 
power-law box. 

8.5. SMGs 

While dust extinction appears to be less prevalent in the 
early universe, heavily dust obscured luminous star-forming 
galaxies have been selected as sub-mm galaxies (SMGs) out 
to z 5 ( Capak et al. 2008 ; Schinnerer et al. 20 08 ; Daddi et al. 
2009b a; Coppin et al. 2009; Knuds en et al.p OlO; Riechers 
et al.. .2010, .Smolcic et al., 2011). At moderate redshifts of 



coincident with the BzK and DRG star- forming populations 
shown in Figures 10a, 11a, 10b, and lib (see also Figure 4). 
At z > 4, however, UV-NIR color cuts are unlikely to identify 
the dust-obscured and IR-luminous SMGs mos t likely to en ter 
the AGN selection region (though see Manc ini^t al.|2009| ). 
In COSMOS, six spectroscopically-confirmed z > 4 SMGs 

hav e been identified over 0.2 deg^. Of these, only AzTEC- 
5 (Marches ini et al.||2QTQ , Capak et al. 2011, in prep.) is 
detected to > 5(j in all four IRAC bands, although AzTECl 
( [Smolcic et al.|201 \\ falls just below our cuts. We plot in Fig- 
ures lOd and l id the IRAC colors of these two high-z SMGs, 
as well as the four (of seven) additional spectroscopically- 
confirmed z > 4 SMGs with high-a IRAC counterparts: the 
GOODS-N sources GN20 and GN 20.1 (D addi et al. 2009bi 
the EGS sou rce LESS J033229.4 ([Coppin et al. 2009; GiUil 
^raLl[20Tl1), and the Abell 2218 source SMM 11635553^ 
( [Knudsen et al.|2010| ). 

All of the z > 4 SMGs fall in the power-law box at -1 < 
a < -2, although only three are bright enough to be included 
(or nearly included) in our COSMOS IRAC sample. At least 
half of the z > 4 SMG s, however, are kno wn AGN (GN20.2 
is a radio-bright AGN ( [Daddi et aH2QQ9bl ), LESS J033229.4 
is a Compton-thi ck AGN detected m the 4Ms Chandra data 
( [Gim et al.|2QTT ), and the IRS spectrum of GN20 indicates a 
considerable AGN contribution to the rest-frame 6 jim emis- 
sion (D. Riechers et al. 2011, in prep.)), and all but one have 
a bright MIPS counterpart that points to a likely AGN contri- 
bution. We therefore expect little to no contamination from 
predominantly star-forming SMGs at moderate to high red- 
shifts. 

9. REVISED IRAC CRITERIA 

We take as a starting point for the revised IRAC selection 
criteria the a < -0.5 power-law box defined in §4. Not only 
does this box enclose the vast majorit y of sources that would 
be identified as power-law galaxies ([Alonso-Herrero et al. 



2006; Donley et al. [20071 IPark al.|2010p , but it also tightly 
encloses the templates of AGN -dominated sources (see Figure 
2) and the region of color space with a high (~ 50%) X-ray 
detection fraction (see Figure 8). 

To prevent contamination from high-redshift {z > 2) galax- 
ies bright enough to be included in the COSMOS IRAC sam- 
ple, we then impose a cut of log(8.0/im/4.5/im) > 0.15, de- 
fined in §8. While this high-z cut removes the majority of 
grids in the power-law box with low X-ray detection fractions 
(see Figure 8), the leftmost corner of the power-law box also 
has a lower than average X-ray detection fraction, presumably 
due to contamination by low-redshift star- forming galaxies. 
To exclude these sources, we impose a vertical cut that coin- 
cides with the intersection of the a < -0.5 power-law box and 
the high-z cut discussed above: log(5.8/im/3.6/im) > 0.08. 

We plot in Figure 12 the IRAC colors of the sources that 
meet these criteria. While all of the new AGN candidates 



1.4 < z < 2.6, more than 80% of bright radio- selected SMGs 
meet one or m ore of the B zK, DRG, or LBG (e.g, BM/BX) 
criteria (Reddy et al.|2005[ ). Their IRAC colors are therefore 



fall by definition within the original Lacy et al. (2004 2007 ) 
AGN selection wedge, 9% have IRAC SEDs that do not rise 
monotonically, but that decrease between 3.6 and 4.5 /im 
or 5.8 and 8.0 /im (or, in rare cases, bet ween 4.5 and 5.8 
/im), placing them on the outskirts of the Stern et al. (2005) 
AGN s election reg i on (of these, 38% formally fall outside 
of the Stem et al.[ ( [2005] ) AGN wedge). This behavior is 
rare among the XMM AGN in our selection region, 97% of 
which have monotonically rising IRAC SEDs (a fraction that 
rises to 99.6% if we consider only luminous QSOs with log 
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Fig. 12. — New AGN selection criteria (thick solid lines). The full COSMOS IRAC sample with 7i > 50 ks are plotted as small points, and sources that meet 
our new criteria are shown by large filled circles. Sources plotted in black (or red/magenta in the online version) have X-ray counterparts, and sources plotted in 
grey (or blue/cyan in the online version) do not. Large (yellow) stars represent sources that lie within the revised wedge in Lacy et al. ( 2004 2007 1 color space 
but that are excluded from our final selection because their IRAC SEDs do not rise monotonically. The (purple) squares and (green) triangles represent the X-ray 
and non-X-ray sources, respectively, that fall well within the Stern et al. (2005) AGN selection region but that are excluded by our new AGN selection criteria 
(see §9.2.3). " " 



^2-iokev(ergs s~^)> 44), and may be due to the 1.6 fim stel- 
lar bump passing through the IRAC bandpasses. This inter- 
pretation is consistent with the median redshifts, z = 0.7 and 
z = 2.2, and low > 50 ks X-ray detection fractions, 17% and 
30%, of the sources with blue 3.6/4.5 /im and 5.8/8.0 /im col- 
ors, respectively. We therefore exclude from our final selec- 
tion criteria all sources with non-monotonically-rising IRAC 
SEDs, plotted as yellow stars in Figure 12. Of the low re dshift 
sources prev iously excluded by the vertical cut in Lacy et al. 
( |2004[[2QQ7] ) color space, 81% would also be excluded by this 
criterion. 

The final revised AGN selection criteria are as follows, 
where A is the logical 'AND" operator: 



^='-^^o{t^)^ y^i-g^ofe) (1) 

x>0.08 A J > 0.15 (2) 
A 3;>(1.21 xx)-0.27 A j < (1.21 x x) + 0.27 

A /4.5/im > f3.6fim A fs.^jj^m > /4.5/im A fs.Ofim > fs.^fim 

These new criteria identify 1506 AGN candidates in COS- 
MOS, only 38% of which have XMM or Chandra counter- 
parts. In regions of deep Chandra coverage (T^ = 50- 160 ks), 
the X-ray detection fraction is 52%. 

Repeating the X-ray stacking analysis for the final AGN 
candidate sample confirms the results of §7.2. The X-ray de- 
tected AGN that meet our IRAC criteria lie at z = 1 .7 ± 0.8 and 
have typical stacked hardness ratios and column densities of 
HR= -0.30 ± 0. 1 1 and log Nu(cm-^)= 22 A ± 0.4. In compar- 
ison, the X-ray non-detected AGN candidates with Tx > 50 ks 
(median Tx = 125 ks) have slightly higher photometric red- 
shifts: Zphot = 2.2 ±0.9. While this population lacks individual 
X-ray counterparts. X-ray stacking leads to ~ 6a detections in 



both the soft and hard X-ray bands with HR= 0.29 ± 0.13 or 
log A^H(cm~^)= 23.5 ± 0.4. IRAC selection therefore appears 
to successfully recover large samples of both unobscured to 
moderately obscured X-ray-detected AGN, as well as heavily 
obscured, high-redshift AGN missed by deep X-ray surveys. 

9.1. Reliability 

The criteria defined above have been designed to reject the 
majority of l ow- and high -redshift star-forming contaminants 
that enter the |LacyetaD( 2004, 2007) and Stern et al. (2005| 
AGN selections wedges (see Sections 2, 7, 8, and 9). While 
approximately half of the AGN candidates selected by these 
criteria are not individually detected in deep (T^ = 50-160 ks) 
Chandra data, their stacked X-rays are consistent with emis- 
sion from heavily obscured AGN. However, 32% of the > 
50 ks X-ray undetected IR AGN candidates have Zphot ^2.7, 
the redshift at which the templa tes of purely star-fo rming lo- 



cal LIRGS and ULIRGS from |Rieke et al.| ( |2009l ) begin to 
enter our new selection region. To determine if these sources 
are star- forming contaminants, we repeat the X-ray stacking 
analysis for three equally-populated redshift bins: z = - 1.8, 
z= 1. 8-2.7, andz = 2.7-4.8. 

The stacked emission from the z > 2.7 IR AGN candi- 
dates is detected to 3.9cr in the 0.5-2.0 keV band and to 
4.0(7 in the 2-8 keV band with an observed hardness ratio of 
HR = 0.16 ± 0.18 (or log Nn(cm-^)= 23.6 ± 0.2 at the mean 
redshift of z = 3.2). As the effective X-ray photon index, 
Feff = 0.67:^0 3^, is harder than even the hardes t star- forming 

a significant 
^ For com- 



galaxy (Feff =1-2, e.g., Lehmer et al. 2008] 



fraction of these sources must be obscured A 



We use v4.2 of the Portable, Interactive Multi-Mission Simulator 
(PIMMS) to estimate Fe ff using the Galactic column density of 2.7 x 
10^*^ cm~^ jElvis et aT 2009) and the Chandra Cycle 8 effective area curves. 
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Fig. 13. — Left: Fraction of hard X-ray and IRAC-detected XMM and Chandra sources that meet the new IRAC cuts, as a function of intrinsic 2-10 keV X-ray 
luminosity. While few low-luminosity AGN meet our criteria, the IRAC selection criteria recovers 75% of XMM QSOs and 69% of Chandra QSOs with log 
^2-i0kev(ergs s~^)> 44. Right: Best fits to the median SEDs of QSO-luminosity XMM sources that do or do not meet our selection criteria. Both the Type 1 and 
Type 2 QSOs missed by the IRAC criteria have slightly redder UV-optical continua and prominent 1.6 /im stellar bumps, indicating that IRAC selection is most 
likely to miss AGN with luminous hosts, particularly when the AGN emission is itself obscured. 



parison, the low redshift bin is detected to 2Acr in the soft 
band and to 4.1a in the hard band with HR = 0.50 ± 0.23 or 
Teff = -0.04!g-p (log Nn(cm-^)= 23.2 ± 0.4 at z = 1.2), and the 
medium redshift bin is detected to 3.2cr in the soft band and to 
2.1(1 in the hard band with HR = 0.26± 0.27 or Teff = 0.49!^ 
(log Nn(cm-^)= 23.5 ± 0.3 at z = 2.2). 

Many of the high-z AGN candidates, however, show a clear 
infrared excess relative to even the reddest star-forming tem- 
plate. To better isolate the sources most likely to be star- 
forming contaminants, we develop the following empirical 
criteria to identify high-z s ources wh ose IRAC photometry 
can plausibly be fit by the |Rieke et al.| ( j2009| ) star-forming 
templates: 




'0.39XZ-0.69 if z = 2.7-3.1 
0.18XZ-0.04 ifz = 3. 1-4.2 
0.06XZ+0.47 ifz = 4.2-5.0 



and > 0.95 

/8.0/nm/ 74.5 Aim ~ 



(3) 



(4) 



The first criterion identifies sources whose 8.0 to 3.6 fim flux 
ratio is not significantly redder than the reddest |Rieke et aL 
(2009) LIRG/ULIRG template at a given redshift, and the sec- 
ond criteria ensures that the curvature of the IRAC photome- 
try is roughly consistent with the redshifted 1.6 /im stellar 
bump. Approximately 50% of the z > 2.7 X-ray non-detected 
IR AGN candidates (or 15% of the full sample of X-ray non- 
detected AGN) meet these criteria and therefore have IRAC 
photometry potentially consistent with star- formation at their 
assumed redshift (though nearly all show an infrared excess 
relative to the bluest of the Rieke et al. ( 2009) templates). X- 
ray stacking of this subsample, however, again returns signif- 
icant (> 3cr) detections in both the soft and hard X-ray bands 
with HR = 0.33 ± 0.22 or Teff = 0.35!^^^^ indicating that any 
contamination by high redshift star-forming galaxies is min- 
imal. Instead, these sources appear to be either high redshift 



AGN whose underlying hosts are bluer than the reddest [Rieke] 
[et al.| ([2009) template, or lower redshift AGN whose photo- 
metric redshifts have been artificially inflated by the fact that 
they were estimated using only star-forming templates, which 
are consistent with red, power-law like IRAC emission only 
at high-z. Interestingly, we do not detect a significant stacked 
signal for the 50% of z > 2.7 AGN candidates with extremely 
red IRAC colors, perhaps because these sources lie at gen- 
uinely higher redshifts or are more heavily obscured. 

9.2. Completeness 

To quantify the completeness of the new selection crite- 
ria, we plot in Figure 13a the fraction of hard X-ray and 
IRAC-detected XMM and Chandra sources that fall in our 
new selection region as a function of their intrinsic 2-10 keV 
luminosities. The completeness of our new selection crite- 
ria is a strong function of AGN luminosity, as expected. At 
log Lx(ergs s~^)< 43, fewer than 20% of AGN meet our crite- 
ria due to dilution of the AGN continuum by the host galaxy 
and/or to the disappear ance of the hot dust torus at low Ed- 
dington accretion rate ( [Trump et al.|[20jJa| ). At QSO lumi- 
nosities of log Lx(ergs s~0^ 44, however, 75% of the XMM 
AGN and 68% of the Chandra AGN fall in the revised IRAC 
selection region. 

9.2.1. Nature of the QSOs missed by our selection criteria 

We plot in Figure 13b the best fits to the median SEDs 
of the XMM QSOs (e.g., log Lx(ergs ^-^)> 44) that fafl in- 
side and outside of our selection region, calculated as de- 
scribed in §5.3. The XMM-selected QSOs that meet our cri- 
teria have a median redshift of z= 1.8, a median luminos- 
ity of log Lx(ergs s~^)= 44.4, a median column density of 
log A^H(cm~^)= 22.3 (for the 82% of sources with a measur- 
able column), and X-ray and optical Type 2 fractions of 53% 
and 28%, respectively. The Type 1 and Type 2 SEDs are re- 
markably similar at A > 1 /im, and they show no sign of host 
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galaxy emission. 

In comparison, the XMM-selected QSOs (again defined to 
have log Lx(ergs s~^)> 44) that fall outside of our selection 
region have a median redshift of z= 1.6, a median lumi- 
nosity of log Lx(ergs s~^)= 44.2, a median column density 
of log A^H(cm~^)= 22.6 (for the 88% of sources with a mea- 
surable column), and X-ray and optical Type 2 fractions of 
74% and 54%, respectively. Their SEDs are characterized by 
slightly redder UV-optical continua and by noticeable 1.6 /im 
stellar bumps, even in the case of the Type 1 QSOs. The QSOs 
missed by IRAC selection therefore appear to be somewhat 
more heavily obscured, lower-luminosity AGN whose host 
galaxies contribute a larger relative fraction of their optical- 
NIR flux. 

9.2.2. Completeness to Heavily Obscured AGN 

To better constrain our completeness to heavily obscured 
AGN, we turn to th e luminous (/24 > 700 /iJy) dust-obscured 
galaxies (DOGs) of |Donley et al.|(|2010I), selected on the ba- 
sis of their high 24 jim to R-band flux ratios. All but one of 
these AGN-dominated sources at z ~ 2 have QSO-like lumi- 
nosities (e.g., log Lx(ergs s~^)> 44), and as many as 80% may 
be Compton-thick. Eight of the 11 QSO-luminosity DOGs 
with good IRAC data meet our new IRAC criteria, with one 
additional source falling just beyond our selection region. 

The two DOGs that fall well outside of our selection re- 
gion (IRBGIO and IRBG13) are the brightest radio sources 
in the sample, and both show a distinct curvature in their 
IRAC SEDs, with an excess of 3.6 and 8.0 /im emission 
and/or a deficit of 4.5 an d 5.8 /im emission (see Figure 



1 o f IDonley et al ||20T0| ). |Haas et al.| (|2008|) and Leip^ 
[ski et al.| ( |2010| ) observe similar SEDs among their sam- 
ple of z > 1 178 MHz- selected double-lobed radio galaxies 
(e.g., radio- selected Type 2 AGN), which they attribute to the 
combined effects of AGN obscuration and host-galaxy emis- 
sion. It is perhaps not surprising, then, that while the new 
IRAC sele ction criteria recover all of the Type 1 3CRR radio 
quasars of |Haas et al.| 12008), they recover only 33% of the 
z = 1.3 =b 0.2 narrow-line (e.g.. Type 2) 3CRR radio galaxies, 
whose low-frequency selection and complete optical identifi- 
cation should result in an obscuration-independent sample of 
luminous radio-loud AG>f^ 
To extend this test to higher redshift , we adopt the Spitzer 



high-z radio g alaxy (SH zRG) sample of [Seymour et al.|p007 



and De B reuck et al.[ ([2010). Of the z = 2.2 ± 0.8 Type 2 
SHzRGs with good IRAC photometry (approximately half of 
the full SHzRG sample), 61% He within our AGN selection 
region. The AGN that are missed tend to have the highest star- 
forming contributions to their 1.6 /im rest- frame flux and the 
lowest radio core fractions, indicating that they are preferen- 
tially viewed in edge-on, potentially heavily obscured orien- 
tations. While the SHzRG and z> I 3CRR radio galaxy sam- 
ples lie at different redshifts, they have similar rest- frame ra- 
dio luminosities. The higher recovery fraction of the SHzRGs 
is therefore likely due to the more heterogeneous selection of 
this sample, whose larger range of core fractions (e.g. orien- 
tations) relative to the 3CRR sample suggests a bias towards 
less heavily obscured radio galaxies. 
The X-ray stacking results from Sections 7 and 9 and the 

The z > 1 3CRR radio galaxies discussed here lie at higher redshifts and 
have higher radio luminosities {uLi,(ll^ MHz, rest)> 10^"^ ergs s~^) than the 
intrinsically MIR-weak and LINER- like narrow-line radio galaxies of|Ogle| 
[et al. ( 2006 ), which may lack a hot dust torus. 
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Fig. 14. — SED of a "Stern-only" source at Zphot = 1-4, with the|Rieke et al.| 
f2009') template of the purely star-forming ULIRG IRAS 12112-^0305 ovef^ 
plotted (after scaling to match the observed 4.5 fim flux density). Circles give 
the source photometry, and squares give the photometry of the template con- 
volved with the IRAC bandpasses. The apparently red color of this source in 
iSternet al.|^2005j color space can be attributed solely to star-forming features 
(e.g., the 1.6 yum stellar bump and the 3.3 /im PAH emission feature). 



jPonley et al . (2010) DOG sample illustrate that a large frac- 
tion of IR-luminous, potentially Compton-thick AGN can be 
recovered by our IRAC criteria. Nonetheless, many to most 
radio-selected Type 2 luminous AGN lie outside of our selec- 
tion region, depending on how the radio sample is selected. 
This difference may stem fro m the more luminous host galax- 
ies of radio-loud AGN (e.g., LacyetaL 2000). While obscu- 
ration in AGN far more luminous than tkeir hosts will lead to 
a reddening of the IRAC SED similar to that seen in the top 
panel of Figure 13b, obscuration in lower luminosity AGN or 
those with particularly luminous early-type hosts will more 
quickly reveal the stellar continuum, resulting in the curved 
NIR-MIR SEDs shown in the bott om panel o f Figure 13b and 
observed by Don ley et al.,(201Q^ , Haas et al.k 2008 ), and Leip-] 
ski et al. (2010). IRAC selection will thereiore preferentially 
miss AGN with more luminous hosts, particularly when the 
AGN emission is itself obscured. 

9.2.3. \Stem et al\\2005\ -selected Sources 

By de finition, the AGN candid ates occupy a well-defined 
region of [Lacy et akj ( [2Q04| |20Q7] ) IRAC color sp ace (though 
we select only 17% of the sources that fall in the |Lacy et aL] 
(2004, 2007 ) wedge). Wh ile nearly all of our new AGN can- 
didates also He within the [Stem et al.[ ( t200 5 ) AGN selection 
wedge, they comprise only 28% of the sources in the [Stem 
et al.| ( |2005,) wedge and cannot be cleanly separated from the 
remaining galaxies in this representation of IRAC color space. 
To investigate the properties of the sources that do not meet 
our se lection criteria, but that lie well within the jStern et aL] 
( |2005[ ) wedge in regions populated by our AGN candidates, 
we plot in Figure 12 the IRAC colors of all Tx > 50 ks sources 
that do not meet our criteria but that have [3. 6] -[4. 5] (Vega) > 
0.55 and [5.8]-[8.0] (Vega) > 0.85. 

This "Stem-only" sample, which comprises 33% of the 
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sources in this reg ion of IRAC color space (and 11% of the 
[Stern et aT] ( |2005| )- selected sources that do not meet our new 
criteria), has a median redshift of z = 1.5, an X-ray detection 
fraction of only 14%, and median 5.8 and 8.0 fim flux den- 
sities (/5.8/im = 15 /iJy, /s.o/im = 23 /iJy) that are 2-3 times 
fainter than our AGN candidates in this region of color space 
(though no such offset is seen at 3.6 and 4.5 /im). 

While these sources ha ve red 3.6 to 4.5 /j im and 5.8 to 8.0 
fim colors, their colors in |Lacy et al.| ( |2004"] ) color space tend 
to be quite blue (see Figure 12). The reason for this behavior 
is illustrated in Figure 14. While our new AGN selection crite- 
ria require sources to have monotonically rising IRAC SEDs, 
only 15% of the "Stern-only" sample display this behavior. 
Instead, their SEDs tend to rise between 3.6 and 4.5 /im, turn 
over between 4.5 and 5.8 /im, and then rise again towards 
long wavelengths. As shown in Figure 14, at the sample's me- 
dian redshift of z= 1.5, this behavior can be attributed to the 
3.6 and 4.5 /im bands sampling the rising side of the 1.6 /im 
stellar bump, the 5.8 /im band falling redward of the bump, 
and the 8.0 /im band sampling the bright MIR emission from 
the 3.3 /im PAH feature and warm star- formation heated dust. 
Moderate redshift star- forming galaxies like the one plot ted in 
Figure 14 (which were excluded from the original Stern et al.| 
|2005) sample by their shallow flux limit of ^'g^m > 76/iJy) 
will therefore mimic power-law AGN when adjacent wave- 
bands are used to determine their IRAC colors, but can ef- 
fectively be separated from AGN using a longer wavelength 
baseline or a requirement that the SED rise monotonically. 

10. SUMMARY 

IRAC selection provides a powerful tool for identifying lu- 
minous AG N. However , the AG N selection wedg es currently 
in use (e.g., |Lacy et al. 2004; Stern et al.||2005|) are heavily 
contaminated by normal star-forming galaxies in deep IRAC 
data. Using the large samples of luminous AGN and high- 
redshift star-forming galaxies in COSMOS, we redefine the 
IRAC AGN selection criteria for use in deep IRAC surveys. 
The new cuts, presented in §9, incorporate the best aspects 
of the current AGN selection wedges and of infrared power- 
law selectio n ( Alonso-Herrero et al. 2006 ; Donley et al. 2007| 
Park et aL]|2010f , while improving on both. They are de- 
signed to be both highly complete and reliable, and effectively 
exclude high-redshift star-forming galaxies selected via the 
BzK, DRG, LBG, and SMG criteria down to IRAC flux limits 
of 0.9, 1.7, 11.3, and 14.6 /iJy in the 3.6, 4.5, 5.8, and 8.0 /im 
bands, respectively. 

The completeness of the new selection criteria is highly lu- 
minosity dependent. While fewer than 20% of Seyfert galax- 



ies with log L2-iokev(ergs s~^)< 43 meet the IRAC selection 
criteria, the new cuts recover 75% of hard X-ray detected 
AGN with QSO-luminosities of log L2-iokev(ergs s~^)> 44. 
Despite this bias towards luminous AGN, however, only 38% 
of the IRAC-selected AGN candidates in COSMOS would be 
identified as AGN in the X-ray, a fraction that rises to 52% in 
regions with deep Chandra data (T^ = 50-160 ks). 

X-ray stacking of the individually X-ray non-detected AGN 
candidates leads to > 6a detections in both the hard and soft 
X-ray bands, with a large implied column density of log A^h 
(cm~^) ~ 23.5 ± 0.4. In comparison, the hard X-ray selected 
AGN have a typical column density of only log A^H(cm~^)= 
22.4 ± 0.4. While some X-ray non-detected AGN are likely to 
be missed in the X-ray because of their higher typical redshifts 
(z ~ 2.2 compared to z ~ 1.7 for the X-ray AGN), heavily 
obscured to mildly Compton-thick obscuration appears to be 
primarily responsible for driving these intrinsically luminous 
AGN below the current X-ray flux limits. 

The selection criteria defined here provide a reliable method 
for identifying highly complete samples of luminous unob- 
scured and obscured AGN with high-quality (e.g., > 5a) 
counterparts in the four IRAC bands, a condition met by 88% 
of the XMM-COSMOS sample. However, IRAC selection 
cannot effectively identify low-luminosity AGN with host- 
dominated MIR SEDs, and also appears to be incomplete 
to luminous heavily obscured AGN with particularly bright 
hosts (e.g.. Type 2 radio galaxies). X-ray and radio selection 
therefore remain important tools for identifying unobscured 
to moderately obscured low-luminosity AGN and obscured 
radio-loud AGN, respectively. While the upcom ing > 10 keV 
NuSTAR X-ray mission (Harr ison eraL]|2010| ) will begin to 
probe the population of heavily obscured Seyfert-luminosity 
AGN at low to moderate redshift (z < 0.4, e.g., Ballantyne] 
et al. 12011] ), new methods such as composite SED tittmg (e.g. 
A. Del Moro 2012, in prep.) will be required to identify the 
distant radio-quiet and heavily obscured Seyfert-luminosity 
AGN missed by current X-ray, radio, and MIR-based selec- 
tion techniques. 
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APPENDIX 

At high redshift, the observed X-ray bands sample progressively harder X-ray emission less sensiti ve to intrinsic obscuration. 
As a result, low column densities become poorly constrained and are often overestimated (see also |Ueda et ar||2003[ |Akylas| 
et al.|2006| ). To illustrate and quantify this bias, we use a Monte Carlo simulation to study the effect of observational errors on 
the measured column densities of X-ray detected AGN (but do not consider here any additional biases in the Type 2 fraction 
introduced by X-ray detectability as a function of redshift and/or obscuration). 
We start with the intrinsic column density distribution of QSO-luminosity AGN (log Lx(ergs s~^)~ 44) from 



Ueda et al. 



(20031, 



which we parameterize as 500 discrete columns. For redshifts ofz = 0.5-3.0, we then calculate the hard and soft X-ray fluxes that 
would be observed for each of the input columns, and vary these ideal fluxes using random Gaussian errors scaled by the observed 
errors on the X-ray fluxes of the XMM-COSMOS sample. Using these randomized fluxes, we then remeasure the column density 
using the method described in §5.1. The results are shown in Figure 15 a. While low column densities are more poorly constrained 
than high column densities even at low-redshift, very few unobscured AGN (log A^H(cm~^)< 22) will be misclassified as obscured 
AGN (log A^H(cm~^)> 22) at z < 0.5. However, as the redshift increases, progressively larger fractions of intrinsically unobscured 
AGN will scatter into the obscured region of Figure 15a. Because heavily obscured AGN are far less likely to be misclassified as 
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Input log Nh (cm ^) Recovered X-ray Type 2 Fraction 

Fig. 15. — left: Contours representing the agreement between model column densities and those recovered from our Monte Carlo simulation for z = 0.5-3.0. 
Sources with unmeasurably low columns are assigned a value of log A^h = 19. right: Recovered Type 2 fractions for the same range of redshifts, assuming the 
column density distribution of Ueda et al. ( 2003 1 truncated at log A^h = 23. The intrinsic Type 2 fraction of 25% is given by an open star, and the median recovered 
Type 2 fractions are given by tilled stars. Because low column densities have little effect on high X-ray energies, both the measured column densities and the 
X-ray Type 2 fractions will be overestimated at high redshift. 

unobscured, this observational scatter leads to an apparent increase in the Type 2 fraction of high redshift X-ray detected AGN. 

To estimate the effect of this bias on the measured X-ray Type 2 fraction of the hard X-ray detected XMM-COSMOS sample, 
89% of which have log A^H(cm~^)< 23, we first truncate the Ueda et al. (2003 ) column density distribution at log A^H(cm~^)= 23. 
The resulting Type 2 fractions recovered from the Monte Carlo simulation are shown in Figure 15b, as is the intrinsic value of 
25%. For the assumed column density distribution, the recovered X-ray Type 2 fraction is overestimated by only 4% at at z = 1, 
the approximate redshift of the bluest XMM sources in Figure 7. However, at z = 2, the typical redshift of the reddest XMM 
sources, the observed Type 2 fraction is likely to be overestimated by ^ 40%. This overestimation of the X-ray Type 2 fraction 
at high redshift contributes to the apparent discrepancies between X-ray and optically-classified AGN (see §5 and §6). 
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